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1 OVERVIEW 


Essentially everything of astronomical interest is either part 
of a galaxy, or from a galaxy, or otherwise relevant to the 
origin or evolution of galaxies. Diverse examples are that the 
isotropic composition of meteorites provides clues to the his- 
tory of star formation billions of years ago, and cosmological 
tests for the deceleration of the Universe are strongly affected 
by changes in the luminosities of galaxies during the lookback 
time sampled. The aim of this article is to review some of the 
vital connections that galaxy evolution makes among many 
astronomical phenomena. 

The evolution of galaxies can be broadly divided into three 
areas: dynamical evolution, chemical evolution, and the evo- 
lution of photometric properties. Chemical and photometric 
evolution are the main topics of this review, and dynamical 
processes will be considered only where they directly affect 
the others. The article is intended to be self-contained for 
readers with a general knowledge of astronomy and astro- 
physics but with no special expertise on galaxies. Because 
the background and literature relevant to the evolution of 
stars and gas in galaxies are so extensive, complete coverage 
cannot be attempted. Instead, comprehensive reviews and re- 
cent papers are emphasized in the references; these include 
detailed works in closely related fields, such as galaxy forma- 
tion and nucleosynthesis, that are treated very briefly here. 
The history of the subject has been discussed in several recent 
reviews (e.g. Sandage et al. 1975; Spinrad & Peimbert 1975; 
Trimble 1975; van den Bergh 1975), so here the emphasis will 
be on current ideas. 


This first Section gives an overview of relevant properties 
of galaxies and theoretical ideas on how they formed and 
evolved, followed by an outline of the rest of the article. 


1.1 Stellar Populations and Chemical Composition of 
Galaxies 


Galaxies are made of stars and interstellar matter (ISM), 
with a variety of properties that correlate remarkably with 
the forms of galaxies as they appear on the sky. Introductory 
references to supplement the following brief outline include 
the classic lectures of Baade (1963) and subsequent reviews 
by Morgan & Osterbrock (1969), King (1971, 1977), van den 
Bergh (1975), and Sandage et al. (1975). The Hubble Atlas of 
Galaxies (Sandage 1961a) is an essential source book for pho- 
tographs and descriptions of typical galaxies, while the Atlas 
of Peculiar Galaxies (Arp 1966) illustrates a great variety of 
unusual forms. 


Most galaxies can be arranged in a natural sequence of 
morphological types, the Hubble sequence. The “earliest” are 
elliptical (E) galaxies, with apparently smooth distributions 
of yellow-red stars and seldom any signs of internal structure 
(except that individual stars and globular clusters can be re- 
solved in the closest ones); and the “latest” are irregulars (Irr 
I) with no symmetry and with disorganized patches of blue 
stars, hot gas, and dust throughout!. In between are the spi- 
rals, in order from Sa types with large nuclear bulges and 
tightly wound arms to Sc types with small bulges and open 
arms; the nuclear bulges closely resemble elliptical galaxies, 
while spiral arms, or more often bits of spiral arms, are traced 
by patchy blue stars and gas and by dark dust lanes, super- 
posed on an underlying smoother disk of redder stars. The 
galaxies classified as SO have disks but lack spiral structure 
and its associated young stars and gas; Hubble put them be- 
tween types E and Sa, but van den Bergh (1976b) pictures 
them as a sequence parallel to the spirals, ordered by bulge- 
to-disk ratio. Another “dimension” of the Hubble sequence is 
based on the presence or absence of a bar in the center, from 
the ends of which the spiral arms seem to emerge; there are 
barred spirals of types SBa with tight arms through SBc with 
loosely wound arms. The Hubble sequence has been elabo- 
rated by de Vaucouleurs (1959) into a more detailed system 
indicating intermediate forms and additional structural fea- 
tures. 

Although the sequence E-S0-Sa-Sb-Sc-Irr I was defined 
by the shapes of galaxies, it is very much a sequence of stel- 
lar populations. The earliest galaxies show no signs of hot 
young stars and (usually) have undetectably little gas or dust, 
while the latest galaxies are undergoing active star formation 
and have significant amounts of ISM. The Yerkes classifica- 
tion system of galaxies (Morgan & Mayall 1957; Morgan & 
Osterbrock 1969) emphasizes the correlation between forms 
and stellar populations of galaxies. For spirals, this system 
is based simply on the central concentration of light, and it 
is found that the highly concentrated galaxies have late-type 
nuclear spectra (indicating a mixture of G, K, and M stars), 
while those with little central concentration have early-type 
spectra (dominated by B stars) even in the nucleus. The 
Yerkes types of galaxies are correlated loosely with Hubble 
types, which is not surprising since the Hubble classification 
is based partly on the amount of light concentrated into a 
nuclear bulge and partly on the tightness of spiral structure. 


' By a historical misfortune, the early-type galaxies are dominated 
by late-type (red) stars, and the late-type galaxies are dominated 
by early-type (blue) stars! 


1.2 Galaxy Formation and Evolution 


1 OVERVIEW 


The increasing prevalence of blue, young stars as one goes 
from elliptical through irregular galaxies is reflected in a gen- 
eral progression from redder to bluer integrated colors. 

From studies of the integrated spectra and colors of galax- 
ies, it is concluded that nearly all galaxies have some very 
old stars, and that their different proportions of red and blue 
stars in the light are due mainly to different current rates of 
star formation. Most (if not all) galaxies appear to be many 
billions of years old, but whereas significant star formation 
ceased billions of years ago in ellipticals, it has occurred at a 
roughly constant rate in the latest types of galaxies. Theories 
of galaxy formation, and of star formation within galaxies, 
are challenged to explain this close correlation between the 
forms of galaxies and their histories of star formation. 

Some galaxies fit nowhere into a tidy sequence of types, and 
they are classified as simply “peculiar” or as one of the special 
types of peculiars such as ring galaxies or Irr II (which are 
early-type galaxies in form but strewn with dust, young stars 
etc.). Many of the peculiar systems are interacting galaxies, 
distorted by tidal effects (Toomre & Toomre 1972). Some- 
times the colors of peculiar galaxies are so blue that any 
underlying old population is completely outshone by newly- 
formed stars, and sometimes they are so red that most of 
the activity must be hidden in dust clouds. Although these 
very peculiar galaxies are rare, they are of special interest 
since some of them may be undergoing the kinds of violent 
changes and rapid star formation that characterized normal 
galaxies in their youth (Larson 1976b). 


The chemical compositions of galaxies in general show 
much less diversity than their stellar populations. Few in- 
stances are known of interstellar gas that is deficient or over- 
abundant in heavy elements by more than a factor of four rel- 
ative to the Sun, and although stellar metallicities range over 
more than two orders of magnitude, they are usually within 
a factor of three of Solar?. Many of the differences among 
chemical compositions in galaxies are systematic: metallici- 
ties of both stars and gas tend to decrease outward from the 
centers of galaxies, and average metallicities tend to increase 
with galaxy luminosity. 

Models for chemical evolution of galaxies aim to account 
for their compositions in terms of the production of elements 
by stars (mainly) and the mixing of stellar ejecta with inter- 
stellar gas. Gas flows often play important roles in chemical 
evolution, diluting the products of nucleosynthesis with unen- 
riched material from outside the galaxy, and carrying metals 
from one part of the galaxy to another. 

Many more details are known about the stellar popula- 
tion in the Solar vicinity of our own galaxy than anywhere 
else, and some of these details contain clues to large-scale pro- 
cesses of galaxy formation. For example, stars with high space 
velocities that make them members of the halo? population 


2 The term “metals” in this field generally includes all elements 
heavier than helium. True metals, especially iron, are most easily 
detected in the spectra of stars, while the common non-metallic 
elements, especially oxygen, are most accessible in the ISM. An- 
other loose usage in this field is that the word “element” frequently 
means a particular nuclide (e.g., the “element” 13C). 

3 The word “halo” is used in the astronomical literature in two 
ways: (i) its classical meaning, a spheroidal population of ordinary 
stars, typified by globular clusters; and (ii) an invisible spheroidal 


are metal-poor by factors of ten or more relative to the Sun, 
while stars with disk motions have metallicities almost en- 
tirely within a factor of three of Solar. Since the classic paper 
of Eggen et al. (1962), this difference has been interpreted as 
evidence that the halo stars formed first, before much chem- 
ical enrichment by deaths of massive stars had taken place. 
Those authors also used the kinematics of halo stars, and a 
model for the collapse of the Galaxy from particular initial 
conditions, to infer a free-fall timescale, ~2 x 108 yr, for the 
collapse and formation of halo stars. Ages that have recently 
been determined for halo and disk stars now point toward a 
slower collapse, however; globular clusters in the Galaxy ap- 
pear to have a range of ages from ~12 — 16 Gyr, while very 
few disk stars near the Sun are older than 10 Gyr (Demarque 
& McClure 1977; Saio 1977; Saio et al. 1977; Twarog 1980). 
The picture presented by these values is of a rather slow col- 
lapse, and a very long timescale for star formation to get 
underway in the outer disk of the Galaxy. A very young star 
cluster at least 10kpc from the Sun in the Galactic anticenter 
direction has recently been assigned a metallicity as low as 
those of halo stars (Christian & Janes 1979), so it seems that 
the outermost disk is still in a very early stage of chemical 
evolution. 


In general, composition differences among the old stars of 
galaxies serve as frozen-in tracers of their early chemical evo- 
lution, while abundances in young stars and the ISM indi- 
cate the progress of continuing enrichment. These effects are 
closely tied to the history of star formation in various re- 
gions of galaxies, which relates them to factors determining 
photometric properties. It will be seen later in this review 
that the aspects of galactic evolution of greatest importance 
for photometric evolution and for chemical evolution, respec- 
tively, are to a large extent complementary, so the two fields 
of study tend to yield different types of information about 
the evolution of galaxies. 


1.2 Galaxy Formation and Evolution 


The striking correlations between the forms and contents of 
galaxies, and systematic changes of content with position in 
galaxies, point to the importance of dynamical processes in 
chemical and photometric evolution. Many theoretical ideas 
on the formation and dynamical evolution of galaxies have 
been motivated by these correlations, and dynamical models 
in turn have led to further understanding of such properties. 
A brief sketch of some current theoretical views of galaxy for- 
mation and dynamical evolution will therefore be given here. 
A fuller non-technical introduction to the field is given by 
Larson (1977c), and more technical reviews are by Doroshke- 
vich et al. (1978), Gott (1977), Jones (1976), Larson (1976a), 
and Rees (1977). 


In the conventional cosmological picture, the primeval gas 
emerging from the Big Bang some 10 — 20 Gyr ago consisted 
almost entirely of hydrogen and helium. There must have 
been large enough density perturbations in this gas for some 


component that provides much more galactic mass than can be 
ascribed to known stars and gas. The first meaning is implied 
throughout this article, unless otherwise stated. 


1 OVERVIEW 


1.3 Plan of this Review 


regions to be locally bound and to collapse despite the expan- 
sion of the Universe, and the existence today of galaxies and 
bound clusters with a 10!°-fold range of masses shows that 
there was a very wide spread of density perturbations. A fa- 
vored mass scale is ~10° Mo, which is the Jeans mass in the 
Universe after (re-)combination of the primeval plasma about 
10° yr after the Big Bang. Maybe star formation first began 
in lumps of this mass, which could themselves have been part 
of larger-scale perturbations destined to collapse eventually 
into a massive galaxy. Although models for galaxy formation 
often start from an idealized smooth density distribution, gas 
in this state would be Jeans unstable on much smaller scales 
than the protogalaxy, and one would expect star formation 
to occur in regions of much higher density than average. 


Independently of cosmological details, a protogalaxy can 
be pictured as a lumpy gas cloud governed by self-gravity 
(and perhaps by perturbations due to neighbors) rather than 
by the expansion of the Universe. If no stars formed, the 
gaseous protogalaxy would collapse, collisions between gas 
clouds would dissipate the energy parallel to the net angu- 
lar momentum vector, and the system would become a flat 
disk. (The process of “dissipation” in this context usually im- 
plies that some of the kinetic energy of the gas is lost by 
collisionally induced radiation). If, on the other hand, stars 
form in much less than the collapse time, there would be no 
dissipation and a spheroidal system of stars would be pro- 
duced; subclustering due to the initially lumpy gas distribu- 
tion would be wiped out except in regions with about the 
mass and density of globular clusters. A spheroidal system 
can also be formed in much longer than the free-fall time if 
dissipation takes much longer, perhaps because the gas is in 
clouds that collide infrequently. 


In general, the formation of spheroidal stellar systems — 
elliptical galaxies and the bulges of spirals — requires that 
stars form on a timescale less than that of gaseous dissipa- 
tion, while the formation of a disk requires the opposite sit- 
uation. The amount of disk possessed by a galaxy therefore 
depends on the amount of gas that remained to collapse dissi- 
patively, after efficient star formation in the spheroidal com- 
ponent. This dynamical picture is consistent with the great 
ages, ~10!°yr, inferred for stars in spheroidal systems, and the 
contrasting range of ages (including ongoing star formation) 
in disks. 


In detail, the differences between spheroidal and disk sys- 
tems are not quite so clear-cut, because the formation of 
condensed nuclei in elliptical galaxies and the central bulges 
of spirals probably involved some dissipation. The enhanced 
metallicities of nuclear stars are thought to be due to an in- 
ward concentration of metals, as gas enriched by massive stars 
further out dissipated its energy and condensed toward the 
center. 


S0 galaxies appear to be intermediate systems, having disks 
but no young stars. One class of theories of their origin sug- 
gests that these properties are intrinsic, in that star formation 
efficiently used up the gas after it had collapsed to a disk. In 
another class of theories, SO galaxies are former spirals that 
are devoid of young stars because their ISM was swept away, 
either in a collision with another galaxy or by an intergalactic 
wind due to motion of the galaxy through intergalactic gas. 
The high frequency of SO galaxies in dense clusters, where in- 
tergalactic gas has been detected, suggests that at least some 
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Figure 1. A schematic outline of the links between various con- 
stituents of galaxies and evolutionary processes. Solid lines indi- 
cate the main factors in chemical evolution, and dashed lines are 
for photometric evolution. 


were formed in the latter way, but other S0 galaxies are quite 
isolated and may be of the “intrinsic” type. 

Sweeping by collisions and intracluster gas are examples 
of interactions with the environment that probably affect the 
evolution of many galaxies, including apparently normal ones 
and very peculiar objects. Other effects include mergers of 
galaxies in the centers of clusters and in close binaries, tidal 
disruption in close passages, and bursts of star formation due 
to such violent interactions among gas-rich galaxies. A spi- 
ral galaxy may also accrete gas from its surroundings during 
most of its life, gradually forming a more massive and more 
extended disk and acquiring new gas to fuel continued star 
formation. Various aspects of interactions between galaxies 
and their environment are reviewed by Ostriker (1977b), Saar 
& Einasto (1978), and Toomre (1977). 


1.3 Plan of this Review 


From the preceding outline, it should be clear that attempts 
to understand the evolution of stars and gas in galaxies in- 
evitably get involved in very diverse aspects of astronomical 
theory and observation. This is not a field in which one can 
hope to develop a complete theory from a simple set of as- 
sumptions, because many relevant data are unavailable or 
ambiguous, and because galactic evolution depends on many 
complicated dynamical, atomic, and nuclear processes which 
themselves are incompletely understood. Because a logical 
development is inappropriate, this article treats different as- 
pects of galactic evolution as pieces of a jigsaw puzzle that 
may some day be put together. 

Figure 1 is a very schematic diagram of relations among 
some of the most important factors in galactic evolution. It 
serves to summarize parts of the preceding outline, to suggest 
how a final jigsaw might be linked, and to show how the topics 
discussed below are related to each other. Two main sets of 
processes and constituents are indicated. 


(i) Chemical evolution depends mainly on the areas con- 
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nected by solid lines, which indicate that stars form from the 
ISM and return some of their mass, including new products 
of nucleosynthesis, to the ISM at death; the composition of 
the ISM also depends on gas flows to and from the region 
under consideration. 


(ii) Photometric properties depend mainly on the areas on 
the areas connected by dashed lines, indicating that the his- 
tory of star formation and the evolution of individual stars 
determine the population at any time. 


Sections 3, 4, and 5 below are on chemical evolution and 
Sections 6 and 7 are on the evolution of photometric proper- 
ties. Since the formation and evolution of stars are vital to 
all aspects of galactic evolution, these processes are reviewed 
first in Section 2. 


2 THE FORMATION AND EVOLUTION OF STARS 


The evolution of galaxies depends critically on properties of 
stars as a function of mass: their formation rates, evolution 
in the Hertzsprung—Russell (HR) diagram, lifetimes, and the 
mass and composition of material returned to the ISM. This 
Section reviews the properties of stars that are most relevant 
to later discussion of galaxy evolution. 


2.1 Basic Physical Properties of Stars 


Table 1 lists for reference some properties of main sequence 
(MS) stars with chemical compositions typical of the nearby 
disk population, i.e. with Solar or slightly lower metallicities. 
Because stars evolve in the HR diagram while still on the 
MS, this Table only applies to a particular stage of evolution 
for each mass: stars with total MS lifetimes less than 10 Gyr 
(masses 2 1 Mo) are listed with the properties they have in 
the middle of this lifetime; less massive stars are listed at 
age 5 Gyr or with mean empirical properties. The theoreti- 
cal properties of MS stars of a given mass and composition 
differ slightly in different series of calculations, and they are 
sensitive to the helium and heavy-element abundances, even 
within the range of values applicable to disk stars, so the tab- 
ulated values should not be accepted too literally. This Table 
is intended only to provide approximate relations between 
such quantities as spectral type and MS lifetime, for quick 
reference. 

An overview of stellar evolution in the HR diagram is pro- 
vided by Figures 2 and 3. Figure 2 is a theoretical HR dia- 
gram with the zero-age MS and evolutionary tracks for stars 
of a few masses; the tracks are drawn lightly in regions where 
stellar evolution is rapid (relative to neighboring points on 
the same track) since few stars are observed at those stages. 
A given star spends only about 10% of its MS lifetime in 
stages of evolution beyond the MS, so the supergiant and gi- 
ant regions of the HR diagram contain a relatively ephemeral 
population of stars. Much the same information is given em- 
pirically in Figure 3, which is a reproduction of the compos- 
ite color-magnitude diagram for open clusters of Sandage & 
Eggen (1969). The clusters have too few stars for the faster 
stages of evolution to be well represented; not only are there 
gaps between the MS and the giant branch, but the latest 
giants are absent from all but one or two clusters. Despite 


Table 1. Representative data for main sequence stars. Quantities 
tabulated are typical values for stars of approximately Solar com- 
position, from a variety of sources including mainly the following. 
Masses m < 0.6 Mo: data from Veeder (1974); 0.8 < m < 10 Mo: 
theoretical tracks of Mengel et al. (1979) and Alcock & Paczynski 
(1978); m > 20 Mo: theoretical tracks of Chiosi et al. (1978) with 
maximum mass loss rates. For m < 0.6Mo, the mean empirical main 
sequence is given; for 0.8 < m < 1.0Mo, points are for an age of 5Gyr; 
and for more massive stars, points are for the middle of the main 
sequence lifetime (Tms/2). 


m (Mo) Tms log (L/Lo) My log Terr B-V Sp 
0.15 T -2.5 14.2 3.48 1.80 M7 
0.25 — -2.0 12.0 3.52 1.60 M5 
0.4 B -1.4 10.0 3.57 1.48 M1 
0.6 = -0.9 7.6 3.64 1.18 K5 
0.8 25 -0.4 6.0 3.70 0.88 K1 
0.9 15 -0.2 5.4 3.73 0.76 G8 
1.0 10 0.0 4.9 3.76 0.64 G2 
1.1 6.4 0.2 4.3 3.79 0.56 F8 
1.2 4.5 0.4 3.7 3.82 0.47 F6 
1.3 3.2 0.5 3.5 3.84 0.42 F5 
1.4 2.5 0.7 3.0 3.86 0.36 F2 
1.5 2.0 0.8 2.8 3.88 0.30 FO 

2 0.75 1.3 1.4 3.98 0.00 AO 
3 0.25 2.1 -0.2 4.10 -0.12 B7 
4 0.12 2.6 -0.6 4.18 -0.17 B5 
6 0.05 3.2 -1.5 4.30 -0.22 B3 
8 0.03 3.6 -2.2 4.35 -0.25 B1 
10 0.02 3.9 -2.7 4.40 -0.27 B0.5 
15 0.01 4.4 -3.7 4.45 -0.29 B0.5 
20 0.008 4.7 -4.3 4.48 -0.30 BO 
30 0.006 5.1 -5.1 4.51 -0.31 09.5 
40 0.004 5.4 -5.7 4.53 -0.31 O9 
60 0.003 5.7 -6.2 4.58 -0.32 O5 


this problem, the diagram of Sandage & Eggen (1969) gives 
a vivid picture of stellar evolution. 


Using these Figures, one can envisage the evolution of a 
single generation of stars in a galaxy, starting with their pop- 
ulations on the zero-age MS. As time passes, the top of the 
MS peels away and short-lived supergiants appear; the MS 
turnoff evolves down to mid-B spectral type in 108 yr, and 
by 10° yr late A and later stars remain on the MS; between 
10° and 10!° yr, the turnoff evolves from late A to early G, 
and there is a population of evolving K and M giants. An 
elliptical galaxy, for example, that had no star formation for 
the last many billions of years, would be expected to consist 
only of K and M giants and the MS up to a late F or early 
G turnoff. In a galaxy like our own, where star formation has 
presumably been continuous for billions of years, the whole 
array of MS stars, turnoff, and giant branches can be found. 


2.2 The Initial Mass Function 


Because the properties of stars depend strongly on their 
masses, the distribution of stellar masses at birth is a function 
vital to the photometric and chemical evolution of galaxies. 
Various definitions and notations for this function are used 
in the literature, so care is needed to avoid confusion in com- 
paring different papers. The following notation is used here: 
the number of stars formed in the mass interval (m, m + dm) 
and in the time interval (t, t+ dt) is 


ġ(m)y (t)dm dt, (2.1) 
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Figure 2. Theoretical HR diagram indicating main sequence life- 
times and evolutionary tracks of stars with approximately Solar 
composition. The zero-age main sequence (ZAMS) is shown, with 
a few stellar masses (Mg) and lifetimes (yr). Also shown are repre- 
sentative evolutionary tracks, shaded heavily where a star evolves 
slowly relative to adjacent points. Sources: ZAMS and tracks for 
m < 3 Mo: Mengel et al. (1979), Z = 0.01, Y = 0.20; ZAMS for 
3<m<10Mo: Alcock & Paczynski (1978), Z = 0.01, Y = 0.29; track 
for m = 4.5 Mo: Harris & Deupree (1976), Z = 0.01, Y = 0.28; ZAMS 
and tracks for m > 20Mo: Chiosi et al. (1978), case with maximum 
mass loss rate, Z = 0.02, Y = 0.28. 


where y(t) is the total mass of stars formed per unit time, 
and ¢(m), which may itself be a time-dependent function, is 
therefore normalized so that 


{ md (m) dm = 1. (2.2) 
0 


The function y(t) is called the star formation rate (SFR) 
and ¢ (m) is called the initial mass function (IMF). It is often 
useful to approximate ¢ (m) by a power law, at least over some 
range of masses, and in these cases the notation used here is 


o (m) œ m’, (2.3) 


where x is called the slope of the IMF*. 

A comprehensive review of the IMF in the Solar neigh- 
borhood and elsewhere has recently been written by Scalo 
(1978), so the following account includes only those aspects 
that are most relevant to the rest of this article. 


4 Quantities equivalent to x, —x, (1+x), and -(1 +x) have been 
called “the slope of the IMF” by different authors. 


Figure 3. Composite color-magnitude diagram for open clusters 
(Sandage & Eggen 1969). Ages at the right refer to the blue point 
for a given cluster. Modern distances and age scales correspond to 
younger ages, e.g., 5 x 10° yr for NGC 188 (Demarque & McClure 
1977). This diagram is an empirical counterpart of Figure 2. 


2.2.1 The local IMF 


The IMF in the Solar neighborhood can be derived from 
counts of field stars, using principles originally derived 
by Salpeter (1955), and generalized to allow for a time- 
dependent SFR by Schmidt (1959, 1963), among others. A 
recent thorough rediscussion of the data and methods is given 
by Miller & Scalo (1979). 

Stars are counted as a function of absolute magnitude, and 
the counts must first be reduced to the present mass distri- 
bution of stars, n(m), where n(m) dm will denote the present 
number of MS stars in the mass interval (m, m+dm). Several 
non-trivial problems arise at this stage, including the follow- 
ing: 


(i) The number of stars in each absolute magnitude in- 
terval must be corrected for post-MS stars, which requires a 
knowledge of color or spectral type; these corrections can be 
large — e.g., near My = +1 there are about equal numbers per 
unit volume of A dwarfs and K giants. 


(ii) Field stars of different masses have very different distri- 
butions perpendicular to the Galactic plane, the scaleheight 
decreasing with increasing mass. The function n(m) obtained 
for stars in a unit volume therefore differs systematically from 
the function for a unit column perpendicular to the plane. 
Because one is normally interested in the whole population 
of stars formed at all heights (or formed in the plane and 
accelerated to more extended orbits), near the Sun’s galac- 
tocentric distance, the star counts should be reduced to the 
average number per square parsec at this distance from the 
Galactic center. 


(iii) The transformation between magnitudes and masses 
of MS stars depends on their chemical composition. 


2.2 The Initial Mass Function 
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(iv) Stars evolve in the HR diagram while on the MS, so 
there is not a unique mass—luminosity relation even for a 
given composition. 


The papers cited above discuss how these problems can be 
handled, and the resulting uncertainties in n(m). 

Let us assume that we know the present mass distribution 
of MS stars per square parsec in the Solar neighborhood, 
n(m). This function can be used to estimate the IMF and SFR 
as follows. Stars with lifetimes (Tm) greater than the age of 
the Galaxy (tı) have accumulated since star formation began 
(t = 0), so Equation (2.1) gives directly 


ti 

n(m) i d(m)w(t)dt, T = th. (2.4) 
0 

If the IMF is assumed constant, this expression simplifies to 


n(m) = (M) Wt, Tm È ti, (2.5) 


where y; is the average past SFR; Equation (2.5) can be 
used in any case if (m) is interpreted as the average past 
IMF. The mass with Tm = t; is called the present turnoff mass 
(mı) since it defines the lowest MS turnoff point in the HR 
diagram for local field stars. In practice, one must consider 
how the mass-lifetime relation depends on metallicity, but 
to illustrate the principles here these effects will be ignored; 
approximate round numbers are mı ~ 1 Mo, ti = 10 Gyr. 

The present population of stars with Tm < ft, includes only 
those that were formed at times less than Tm ago, so from 
Equation (2.1) their mass distribution is 

ti 
n(m) = (m) y (f)dt, Tm < ti. (2.6) 
HT 
If (m) is constant, it can be taken out of the integral, but n(m) 
still depends on details of the past SFR. A simpler equation 
can be written for stars with lifetimes shorter than the present 
timescale for changes in y (t): 


n(m) = o(m) WiTm; Tm & hi, (2.7) 


where y is the present SFR. The function n(m) is supposed to 
be averaged over the patchy distribution of the youngest stars 
in the Solar neighborhood, so Equation (2.7) is probably a 
good approximation for m z 2Mo, i.e., for stars with lifetimes 
< 1Gyr and MS spectral types earlier than about AO; it would 
hold for all m > m; if the SFR were constant. 

From the empirical function n(m), therefore, one can use 
Equation (2.5) to derive the shape of the (average) IMF for 
stars below about 1M, and Equation (2.7) to derive the shape 
of the IMF for stars above about 2 Mo. These two pieces 
of the IMF are not determined with the same multiplicative 
factors, since the quantities actually determined are ¢(m) Wt, 
and ¢(m) ww, respectively. In particular, the relative values of 
the IMF for 1 Mo and 2 Mo depend on the ratio 


_ vat 


T 2.8 

= y pea 
which is evidently a timescale for star formation in the Solar 
neighborhood. 


There is an intermediate mass interval, 1 < m/Mo < 2, for 
which Equation (2.6) cannot be simplified, so the shape of the 
IMF cannot be derived independently of details of y(t). This 
gap is usually filled by assuming that the functions for higher 
and lower masses can be smoothly interpolated, as illustrated, 


log (Ymg) 


log m 


Figure 4. The initial mass function in the Solar neighborhood, 
shown in the form log[jm¢(m)| versus log m, where (m) is the 
IMF itself, normalized as in Equation (2.2), and yw is the present 
star formation rate in Mo pe~* Gyr7!. Masses are in solar units. 
Solid line and shaded area: range of functions derived by Miller 
& Scalo (1979), allowing for uncertainties in the basic data and 
in the history of the star formation rate. Dashed line: power-law 
segments, given by Equation (2.9); the slopes of these lines are -x 
in the notation of Equation (2.3). (The dashed line and the shaded 
area are discrepant for —0.4 < log m < 0.0 because different stellar 
luminosity functions were used). 


for example, by Schmidt (1963, Figure 1). In this way, (m) 
for the intermediate mass range is determined without too 
much ambiguity and constraints are set on the quantity Tı. 


The IMF derived along the preceding lines by Miller & 
Scalo (1979) is illustrated in Figure 4, where the quantity 
plotted (logarithmically) is mé(m)y,. The solid line is their 
analytical relation for the case of a constant SFR and a 
Galactic age t4 = 12 Gyr: mọ(m)yı = 3.83 exp[-1.09(l0og m + 
1.02)7] pc? Gyr! (with m in solar units). The shaded area 
shows the range of uncertainty allowed by their analytic fits 
to limiting cases, based on uncertainties in the basic data and 
the limits for smoothly joining the low- and high-mass ends, 
as explained above. Miller & Scalo (1979) stress that, if one 
is making models for the Solar neighborhood, the choice of 
an IMF within the shaded area should be made consistently 
with the adopted SFR function, for otherwise the two func- 
tions together would not reproduce the observed star counts. 

Power laws are very tractable analytically, so it is tempting 
to fit straight lines to the IMF derived from the star counts. 
Salpeter (1955) originally derived a famous slope x = 1.35 for 
the local IMF, but subsequent star counts and stellar lifetimes 
show that a single power law for the whole mass spectrum is 
a poor approximation; serious systematic errors can be intro- 
duced in some applications by ignoring the increase of slope 
with mass in the local IMF. The four dashed straight lines 
in Figure 4 are intended as a compromise between the data 
and analytical convenience. (The discrepancy with the shaded 
area between 0.4 and 1 Mo arises because the dashed line for 
those masses is based on the luminosity function of Wielen 
(1974), which is flatter than the function adopted by Miller & 
Scalo 1979, Figure 1). The equations of these four lines, which 
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have slopes —x in the logarithmic plot (see Equation 2.3), are: 


mo(m, = 1.00m™°>, 0.4 <m < 1.0, 
1.00m-!°, 1.0 < m < 2.0, 
1.23m!3, 2.0 < m < 10.0, 
12.3m-*3, 10.0 < m < 50.0, 


(2.9) 


in units of pe? Gyr"!, with masses in solar units. (It is for- 
tuitous that the coefficient 1.00 occurs in these equations). 
A slope is not given for m < 0.4 Mo because M dwarfs con- 
tribute little to the light of any galaxy and nothing directly 
to chemical evolution; their contribution to the total mass 
can be included with the invisible objects below 0.1 Mo. 
Integrating Equation (2.9) with respect to mass, we find 
that the present SFR for masses 0.4—50Mo is 3.0Mo pe? Gyr. 
The total SFR, y1, could be derived if we knew the contribu- 
tion from objects below 0.4 Mo, and y; in turn would allow 
the IMF to be normalized as in Equation (2.2). As described 
in Section 2.3.1, Miller & Scalo (1979) find yı in this way by 
defining as “stars” only objects above 0.1 Mo, for which there 
are star counts giving the shape of the IMF. Alternatively, 
yı can be estimated indirectly (see Section 2.3.1). The values 
found are within a factor of ~2 of 10 Mo pe? Gyr"!, which is 
several times greater than the rate for objects above 0.4 Mo. 


2.2.2 The IMF at other times and places 


Several questions can be asked about the constancy of the 
IMF. Is the IMF the same in regions other than the Solar 
neighborhood? Is it constant in time, locally and elsewhere? 
If variations occur, do they involve the shape of the IMF for 
stars above say 0.4 or 1 Mo, or do they involve only the mass 
fraction in “inert” low-mass objects? 


Star clusters in the Galaxy and the Magellanic Clouds have 
a variety of MS luminosity functions indicating significant 
variations in the IMF on the scale of clusters (Da Costa 1977; 
Freeman 1977; others reviewed by Scalo 1978); also, there are 
regions of the Milky Way where the only newborn stars seem 
to be T Tauri stars, with masses < 2 Mo, and other regions 
abounding in young OB stars. These observations, however, 
do not necessarily imply variations on galaxy-wide scales in 
averages over many sites of star formation. In fact, counts of 
field stars in nearby galaxies have not revealed any significant 
deviations from the local IMF (Butcher 1977; Hardy 1977; 
Lequeux 1979). The clearest evidence for large-scale varia- 
tions is the mass function for nearby halo stars of Schmidt 
(1975); this function has x = 2 for 0.25-0.75Mo, whereas local 
disk stars have x < 1 in the same mass range. There are also 
some early-type spiral galaxies with widespread patches of 
young blue stars but a deficiency of H 11 regions, suggesting 
that the IMF is deficient in O stars; examples are the Som- 
brero galaxy, M104 (van den Bergh 1976a; Schweizer 1978), 
and NGC 2841 (Kormendy 1977). 

The upper part of the IMF (m > 0.4Mo) can be studied 
indirectly by comparing colors of galaxies with those of mod- 
els based on various IMF's. As discussed in Section 7, the 
results are consistent with the local IMF holding (on large 
scales) everywhere, but in some respects the test is rather 
insensitive. For example, a galaxy like the Sombrero is so 
dominated by old yellow-red stars that the integrated colors 


at optical wavelengths would be changed negligibly by the 
presence or absence of stars above 10 Mo in the IMF. The 
red-infrared spectra of galaxies show that giant stars, rather 
than late dwarfs, provide most of the light at these wave- 
lengths, and this result sets some constraints on the IMF for 
stars of ~0.4—1M,;: if the IMF were much steeper than the lo- 
cal function (say if x > 2), low-mass dwarfs would contribute 
much of the infrared light, contrary to the spectroscopic in- 
formation. 

Because of this giant dominance in the infrared light, pho- 
tometric observations are very insensitive to the mass frac- 
tion in stars below 0.4 Mo, i.e. M dwarfs; these stars could 
be numerous enough to dominate the mass of the system 
while contributing very little light! A handle on the mass 
fraction in invisible stars is given by the mass-to-luminosity 
(M/L) ratio, which is crudely a ratio of very low-mass to more 
massive stars. In the Solar neighborhood, the integrated lu- 
minosity of known stars (per pc*), including giants, divided 
by the dynamically-estimated density of mass, yields a ra- 
tio M/Lg = 3 Mo/Lgo (Faber & Gallagher 1979). If the IMF 
and proportion of invisible matter were invariant, all galaxies 
should have this M/L ratio, apart from a systematic increase 
by a factor of ~10 from the bluest galaxies to the reddest 
(because of the decreasing fraction of young blue stars), and 
apart from a scatter ~10% due to different mass fractions of 
ISM. In fact, observed M/L values show a scatter by a factor 
of 10 or more at a given color, and a tendency to increase 
from values of a few in the central parts of galaxies to ~100 
in the outer regions (Faber & Gallagher 1979). It appears 
that different proportions of hidden matter exist in different 
galaxies, and that the invisible mass fraction increases with 
radius. If this matter condensed before ordinary star forma- 
tion began in galaxies, as in some pictures of the formation 
of heavy invisible halos (e.g. White & Rees 1978), then it 
would have no direct effect on chemical or photometric evo- 
lution. (It would have indirect effects, because the motions of 
gas and stars in the galaxy would be influenced by the hid- 
den mass). However, if there are variations in the low-mass 
part of the stellar IMF, chemical evolution would be affected 
significantly, because variable proportions of stellar matter 
would be returned to the ISM by evolving stars. 

Certain chemical properties of galaxies can be explained 
by invoking variations in the shape of the IMF for massive 
stars. For example, the extreme paucity of metal-poor dwarfs 
in the Solar neighborhood could be due to an initial burst 
of massive metal-producing stars; and variations in relative 
abundances of heavy elements could be due to variable pro- 
portions of the stars that synthesize such elements. But in all 
cases, other explanations not involving variations in the IMF 
are available, as discussed in Sections 4 and 5. 


In summary, significant variations in the IMF may be rare 
enough for the assumption of a universal function (the local 
IMF) to be useful for many contexts. However, it is equally 
relevant to ask how galactic evolution would be affected by 
changes in both the form of the IMF for visible stars and the 
relative weight of very low-mass objects. 


2.3 Rates of Star Formation 


The rate of star formation is one of the main factors in 
galactic evolution. The shape of a galaxy depends on the 
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timescale for star formation relative to collapse and dissipa- 
tion timescales, the colors and luminosity depend on the age 
distribution of stars, and chemical evolution depends on the 
SFR relative to the gas flow rates and the gas mass. Various 
ways of estimating SFRs in galaxies are reviewed here. 


2.3.1 The local SFR 


There are several ways of estimating the present SFR in the 
Solar neighborhood, which is the quantity in Mo pe? Gyr"! 
denoted yı above. Stars above ~2Mg, are no problem, because 
they are so short-lived that their numbers scale directly with 
yı (Equation 2.7), but the ages of less massive stars are un- 
known, as is the mass fraction contained in objects below 
0.1 Mo. Most of the methods that have been suggested for 
overcoming these problems reduce in principle to estimating 
the average past SFR (w,) and then the ratio y,/yı >. The 
following two methods are typical. 


(i) A dynamical estimate of the total surface density in the 
Solar neighborhood, called the “Oort limit”, can be derived 
from motions of stars perpendicular to the Galactic plane; 
its value is probably ~80 — 100 Mo pe? (Oort 1965). Of the 
total surface density, ISM in various forms accounts for 5 — 
10 Mo pe? (Gordon & Burton 1976), so stars are presumed 
to contribute ~70 — 95 Mo pe (neglecting any possible non- 
stellar condensed objects). This is not the total mass of stars 
ever formed, because some fraction R has been ejected back to 
the ISM by evolving and dying stars; R depends on the IMF, 
whose normalization is not known without y; itself, but all 
reasonable normalizations give R in the range ~0.1 — 0.3 (as 
shown in Section 3.1). The mass of stars ever formed is thus 
10% — 30% greater than the mass now present; the mean past 
SFR is thus given by 75 < Wt, < 125 Mo pe. 

The next step is to use the ratio T, (Equation 2.8), derived 
by joining smoothly the IMFs with different multiplicative 
factors for stars above 2 Mọ and below 1 Mo, respectively. 
Plausible values of T, lie between 5 and 20Gyr (Tinsley 1976; 
Miller & Scalo 1979), so we are left with the following range of 
values for y; itself: 1.5 < yı < 25Mo pe? Gyr™!. The uncertain- 
ties at various stages of the derivation are mutually indepen- 
dent, and the 20% uncertainty in R is relatively unimportant. 
A more stringent upper limit could be obtained if one could 
assume that the SFR has not increased since the time the 
oldest disk stars formed, ~12 Gyr ago; then Tı > 12 Gyr and 
Wi < 10 Mo pe”? Gyr!. However, although the assumption of 
a monotonically decreasing SFR is common, recent work on 
the age distribution of disk stars (T'warog 1980) shows that 
the SFR more likely increased between 12 and 5 Gyr ago, and 
was approximately constant in the last ~5 Gyr. In this case, 
T, is probably between about 5 and 10 Gyr, and the above 
limits on wt lead to 8 < yı < 25 Mo pe? Gyr. Values only 
~10-—20% of these have usually been derived in the literature, 
on the assumption that Y(t) has been a decreasing function 
for at least 10 Gyr. 


5 Another approach in the literature is to find the formation rate 
of relatively massive stars (above 2 Mo or a greater limit), and 
to scale to the total SFR via an assumed IMF. Obviously, the 
problems mentioned above are sidestepped rather than solved, so 
the conclusions should be regarded as correspondingly uncertain. 


(ii) Instead of using the Oort limit to estimate the total 
mass of stars ever formed, one can use star counts to obtain 
the existing population (and to allow for the total mass of 
stars that have evolved away in the time since they formed) 
down to ~0.1 Mo, and neglect any less massive objects. In 
this way, Miller & Scalo (1979) derive a range of values 43 < 
Wit, < 144M, pe. Combining these values with appropriate 
limiting values of Tı, they find self-consistent values in the 
range 3 < yı <7Mo pe? Gyr!. 


Method (i) above gave larger values than those of Miller 
& Scalo (1979) for two reasons: the estimated mass of stars 
now present above 0.1Mo falls short of estimates based on the 
Oort limit (although the discrepancy may not be significant), 
and Miller & Scalo (1979) considered both decreasing and 
increasing SFRs. Altogether, it is plausible to suggest that 
yı lies within a factor of about two of 10 Mo pe? Gyr!. 


2.3.2 The SFR elsewhere 


Estimates of the SFR in other places use various indicators 
of the numbers of fairly massive stars, and convert these to 
a total SFR by means of an assumed IMF, which is usually 
the local function (or an assumed power-law approximation). 
Methods that have been used include the following. 


(i) The brightest individual stars, supergiants, and occa- 
sionally upper-MS stars, can be counted in nearby galaxies. 
Apart from complications due to differences in chemical com- 
position, the numbers of massive stars in given stages of evo- 
lution should be directly proportional to the SFR, if the IMF 
is the same everywhere; their numbers relative to counts of 
similar stars in the Solar neighborhood thus scale fairly di- 
rectly with the SFR. Lequeux (1979) has applied this method 
to galaxies of the Local Group, using various types of stars, 
and several authors (e.g. Searle et al. 1973; Larson & Tinsley 
1974) have used such rare objects as supernovae in the same 
way for more distant galaxies. 


(ii) If hot young stars are present in a region, they gen- 
erally emit most of the Lyman continuum photons that are 
available for ionizing interstellar hydrogen. The flux of Ha, 
Hf, or free—free radio emission is therefore approximately pro- 
portional to the SFR. This method has been applied to exter- 
nal galaxies by Cohen (1976) using integrated Ha equivalent 
widths, and by Huchra (1977) using HB; Smith et al. (1978) 
have studied SFRs in the Galaxy using radio emission from 
H II regions. 


(iii) The integrated colors of galaxies depend strongly on 
their relative proportions of young and old stars; as a result, 
UBV colors in general provide a rough estimate of the ratio 
(SFR) / (mass of old stars). This approach will be discussed 
in detail in Section 7. 


(iv) Regions of star formation are often heavily obscured 
by dust, but even so the infrared luminosity (starlight re- 
radiated by dust grains) gives a measure of the SFR, as dis- 
cussed in Section 7.2.2. 


Most of these methods rely strongly on the assumption of 
a universal IMF, since they sample only the very massive 
stars; integrated colors can reveal large departures from an 
assumed IMF (Section 2.2.2), but even these are completely 
blind to stars below ~0.5 Mo. Even granted the assumption 
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of universality, important uncertainties arise in scaling from 
the observations to a total SFR, because the statistics of very 
massive stars are poor. Large systematic errors can arise from 
the use of a single power law for the whole IMF (a common 
practice), as can be clearly seen from Figure 4. 

Despite the uncertainties, the results of various studies have 
led to a coherent picture. In the Galaxy, some 10% of the 
current star formation is occurring in the innermost 1 kpc, 
and most of the remainder is concentrated in a ring between 
5 and 8 kpc from the center, which is the site of most of the 
Galaxy’s giant molecular clouds, infrared emission, and other 
signs of intense star formation. (There are several relevant 
reviews in the symposium volume edited by Burton 1979). 
Morphologically normal galaxies show a systematic trend of 
SFR with type, which can be described as a progression from 
a very short timescale for star formation in the earliest types 
to very long in the latest. Some peculiar galaxies appear to be 
undergoing intense bursts of star formation. These properties 
will be discussed later in Section 7. 


2.38.3 Factors affecting the SFR 


An obvious question to ask is why stars form at various rates 
in various places. Factors affecting SFRs have been reviewed 
by Larson (1977b), and some salient points will be mentioned 
here. 


A popular assumption, following Schmidt (1959), is that 
the SFR varies as a power n of the gas density. However, at- 
tempts to determine n empirically run into two problems: n 
appears to vary within and among galaxies, indicating that 
the gas density is not the only relevant quantity; and the 
very definition of “gas density” is ambiguous since it depends 
strongly on the spatial resolution of the observations. Con- 
sequently, a power-law formula for the SFR cannot be given 
the status of a physical law, but it can be regarded as a useful 
parametrization in some circumstances (as in the later study 
of Schmidt 1963). 

The physical environment needed for star formation is ev- 
idently cool, gravitationally bound gas clouds, so the main 
question is: under what conditions does interstellar gas clump 
into clouds? Larson (1977b) argues that gas compression 
is the key requirement for star formation, and that both 
large- and small-scale dynamical processes are important. On 
small scales, interstellar shock and ionization fronts have been 
widely discussed as causes of star formation. These processes 
can be self-sustaining, since supernovae and H II regions aris- 
ing from recently-formed stars may induce further star for- 
mation nearby (e.g. Elmegreen & Lada 1977). Large-scale 
compression mechanisms that may lead to star formation in- 
clude gravitational settling of gas into a thin layer, density 
waves of bar or spiral form, high-velocity collisions between 
gas streams in young or interacting galaxies, and accretion of 
intergalactic clouds. If star formation can be self-propagating 
over long distances, differential rotation can spread the ac- 
tive regions out to produce spiral structure (Gerola & Sei- 
den 1978). Large-scale processes are especially relevant to 
the long-term evolution of galaxies, and since some of them 
involve interactions with external matter, galaxies do not nec- 
essarily evolve as independent systems. This point has been 
emphasized in a review by Saar & Einasto (1978). Further dis- 
cussion of star formation can be found in the reviews cited, 


many references therein, and in a symposium volume edited 
by de Jong et al. (1977). 


Although some understanding has developed of the condi- 
tions under which stars form, no formula for the SFR has 
emerged that can make a useful prediction for any galaxy. 
Lynden-Bell (1977) has pointed out that a proper expression 
for the SFR would contain so many unknown parameters as 
to be useless. He wrote, “This rate R (the SFR) probably de- 
pends on gas density pg, gas sound speed cs, shock frequency 
ws, Shock strength Vr/cs, gas rotation Q and shearing rate A, 
the magnetic field strength |B|, the gas metal abundance Z, 
and possibly the background star density p,. Thus 


R=R(pq, cs, Ws, Vr/cs, Q, A, |B|, Z, p4). 


However, if we knew the true functional form of R and offered 
it to a galaxy builder he would probably tell us ‘Oh, go and 
jump in the lake, that’s far too complicated’.” 

The only feasible approach to galaxy building is to be less 
ambitious than to want a physical formula for the SFR. In- 
stead, schematic expressions, guided by the above ideas on 
relevant factors, can be tested to see how galactic evolution is 
affected by various parameters. Some examples already men- 
tioned just above and in Section 1 are that self-propagating 
star formation can lead naturally to spiral structure, and 
that the final shape of a galaxy depends on the ratios of the 
timescales for star formation, collapse, and gaseous dissipa- 
tion in the protogalaxy. Many examples of schematic expres- 
sions for the SFR, in models for chemical and photometric 
evolution, will be considered later. 


2.4 Stellar Evolution Beyond the Main Sequence 


Late stages of stellar evolution are important to galaxies be- 
cause stars burn much of their nuclear fuel after leaving the 
main sequence. The products of this late fuel guzzling pro- 
vide most of the chemical enrichment of galaxies, and the 
energy liberated provides most of their integrated light. The 
following outline of stellar evolution and nucleosynthesis em- 
phasizes aspects that will be relevant later in the discussion 
of galaxies, and necessarily only skims the surface of these 
topics. Further details and references can be found in the 
comprehensive review by Trimble (1975), and in other pa- 
pers cited below. 


2.4.1 Stars near Solar mass 


Giant stars of approximately Solar metallicity and mass are 
especially interesting because they provide most of the light 
of elliptical and S0 galaxies and the nuclear bulges of spirals. 
The dominant old-disk population of the Solar neighborhood 
provides a convenient sample of such stars for detailed study; 
these may be somewhat younger than most stars in spheroidal 
systems, but evolution on the giant branch is thought to be 
rather insensitive to the initial stellar mass in the small range 
whose lifetimes are ~5 — 15 Gyr. 


Figure 5 gives color-magnitude diagrams of the old open 
cluster M67 and a sample of field giants, which are necessary 
for completeness because the cluster is so poor that it contains 
none of the short-lived M giants. Selection effects in the field 
sample mean that earlier giants are badly under-represented 
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Figure 5. Color-magnitude diagrams for the old open cluster M67 
(Racine 1971; filled circles) and for red giants in old-disk groups 
(Eggen et al. 1962; Tinsley & Gunn 1976b; open circles); crosses 
denote variable stars. The M67 data have been reduce to Mpoi 
versus R- I, as in Tinsley & Gunn (1976b). This diagram illus- 
trates the types of stars that apparently contribute most of the 
light in old, metal-rich systems, such as giant elliptical galaxies 
and the nuclear bulges of spirals. Because of the small total cluster 
population, there are no late giants in M67, and because of selec- 
tion effects the visually brightest stars are over-represented in the 
groups. 


relative to later types, and an unbiased luminosity function 
would show roughly a factor of two decrease in the number of 
stars per bolometric magnitude interval, for each magnitude 
increase in luminosity up the giant branch. Some relevant as- 
pects of the stellar evolution responsible for this distribution 
of stars are as follows (e.g. Iben 1974; Tinsley & Gunn 1976b): 


(i) When the central 10% of hydrogen has been converted 
to helium, the star leaves the MS and its site of hydrogen 
burning moves from the center to a shell around the helium 
core; as the shell is established, the star moves along the 
subgiant branch (at Mpo = 2.5 in Figure 5) to the base of the 
giant branch, at which point the helium core mass is ~0.2 Mo. 


(ii) During the first ascent of the giant branch to the so- 
called “tip” at Mboi =~ —3, the helium core grows to ~0.45 Mo. 
Although this ascent occupies only about 10% of the preced- 
ing lifetime, the energy emitted, which is nearly proportional 
to the mass of hydrogen burned, is greater. 


(iii) Helium ignition in the degenerate core — the “helium 
flash” — halts the ascent of the giant branch, and the star 
very quickly drops to a lower luminosity, My. = 0.5, where 
it remains for about 108 yr while the core helium is burned 
to carbon (and a little oxygen). This stage appears as the 
horizontal branch to the left of the giant branch in HR di- 
agrams of globular clusters, but most metal-rich stars form 
a clump of early K giants superposed on the ascending gi- 
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ant branch; some metal-rich blue horizontal-branch stars are 
found in the old-disk population, and these may represent a 
minority that lost a large fraction of their envelopes at the 
helium flash (e.g. Butler et al. 1976). Although the total en- 
ergy emitted by core-helium-burning stars is relatively small, 
blue horizontal-branch stars can make an important contri- 
bution to the light of galaxies whose bluest stars otherwise 
are at the turnoff. 


(iv) The star then ascends the giant branch for a second 
time (called the asymptotic giant branch stage because it ap- 
pears as an asymptote to the left of the first giant branch in 
the HR diagrams of globular clusters), with two shells burn- 
ing hydrogen and helium respectively. The star can become 
brighter and cooler than the first giant branch “tip”, as seen 
in Figure 5 but not in all HR diagrams for globular clusters. 


(v) The end of this stage of evolution is determined by 
mass loss, without which the whole star would become a 
carbon-oxygen white dwarf. In fact, there is much evidence 
that several tenths of a solar mass of envelope are lost be- 
fore the burning shells reach out to consume the whole initial 
mass of hydrogen. For example, the average mass of white 
dwarfs is ~0.7 Mo, well below the turnoff mass (~1 Mo) for 
field stars; planetary nebulae, most of which belong to the 
old-disk population, represent ~0.2 Mo of ejected matter; and 
spectroscopic observations of M giants show that they are los- 
ing mass in a wind. The existence of both very late M giants 
and blue horizontal-branch stars in the old-disk population 
suggests that mass loss occurs at a variety of rates among 
stars of a given initial mass and composition. Since the av- 
erage final core mass of solar-mass stars must be the white 
dwarf mass, ~0.7 Mo, the energy liberated during the second 
ascent of the giant branch is comparable to that liberated 
on the first ascent, with the difference that most of it now 
appears in the near infrared from very cool stars. 


Current stellar models do not predict post-MS evolution 
accurately enough to be used directly in synthetic galaxies. 
The main uncertainties are the depth of the surface convec- 
tion zone, which determines the effective temperature at a 
given luminosity, and mass loss; a related problem is the out- 
ward mixing of products of nucleosynthesis, which are ob- 
served on the surfaces of many giants but cannot be predicted 
in detail. Consequently, galaxy models must rely heavily on 
empirical studies of giant populations. The implications of 
these problems for understanding photometric properties of 
elliptical galaxies are reviewed by Faber (1977). Mass loss 
and mixing also mean that even these low-mass stars con- 
tribute significant amounts of newly synthesized elements to 
the ISM, including for example carbon and nitrogen, which 
are often overabundant at the surfaces of giants and in plan- 
etary nebulae. The isotopes °C and N are by-products of 
hydrogen burning by the CNO cycle, while '*C must be a 
product of core helium burning (e.g. Trimble 1975; Iben & 
Truran 1978). 


2.4.2 Stars of 1—4Mo, 


White dwarfs appear in open clusters where their progeni- 
tors must have been stars with initial masses up to ~4 Mo 
(e.g. Tinsley 1975a). The substantial mass loss required is 
in agreement with stellar models that allow for winds (Fusi- 
Pecci & Renzini 1976; Mengel 1976); these models suggest 
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Figure 6. Composite HR diagrams for (mainly) post-main-sequence stars in galactic clusters grouped by age (Harris 1976). Ages (t in yr) 


are indicated. These diagrams illustrate the distribution of supergiants expected in galaxies containing stars of the appropriate ages. 


that stars up to some limit ~3 — 6 Mo lose so much mass that 
they fizzle before the point of carbon ignition, which would 
otherwise occur (possibly explosively) when the degenerate 
core reached 1.4 Mo. Therefore, stars with MS lifetimes in 
the whole range from ~108 to ~10!° yr, from mid-B to early 
G on the MS, evolve internally similarly to the solar-mass 
stars discussed above, and they make similar contributions 
to nucleosynthesis. 


2.4.8 Stars above 8 Mo 


Stars initially more massive than ~8 Mo, which are early B 
and O stars on the MS, are believed to ignite carbon (qui- 
etly) in a non-degenerate core and to proceed through many 
stages of nuclear burning. Models acquire an onion-skin struc- 
ture of successive elements, with unaltered envelope material 
(mostly hydrogen) on the outside, then layers of helium, car- 
bon, oxygen, neon, magnesium, etc., with the ashes of each 
nuclear reaction providing fuel for the next. These stars and 
their contributions to nucleosynthesis are discussed by Ar- 
nett (1978), Weaver et al. (1978), and Chiosi (1979), the last 
stressing the very important effects of mass loss. Theoretical 
models predict that the stars undergo core collapse after the 
last exoergic reaction, formation of iron-peak elements, and 
the event is expected to send a shock wave through the star 
leading to explosive nucleosynthesis of most of the less abun- 
dant elements. Because Type II supernovae are associated 
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with the most massive stars in galaxies, they are believed 
to represent these explosive stellar deaths. The lower initial 
mass limit for stars with this fate is somewhere in the range 
6-10Mo, depending on the structure of stellar cores at carbon 
ignition; it is sometimes suggested that there is also an upper 
limit for such deaths, stars above the higher limit collapsing 
to black holes that swallow most of the new elements (e.g. 
Wheeler 1978a). 


The evolution of massive stars in the HR diagram (as well 
as internally) must be strongly affected by mass loss, since 
winds of ~10° Mo yr7! are inferred from the spectra of MS O 
stars and much higher mass-loss rates are common in super- 
giants (Conti 1978). These rates mean that the stellar mass 
is changing significantly on an evolutionary timescale. The 
evolutionary tracks of such stars in the HR diagram cannot 
be predicted reliably (e.g. Chiosi et al. 1978; Stothers & Chin 
1978), so models for galaxies with young stars are subject 
to some uncertainty. Empirical illustrations of the evolution- 
ary tracks of massive stars are provided by compilations of 
HR diagrams of open clusters in age groups (Harris 1976), of 
which the youngest few are reproduced in Figure 6. The gen- 
eral trends with age are clear, but there are obviously many 
uncertainties due to the small numbers of supergiants, and 
a scatter that may be due partly to the presence of clusters 
with different metallicities. 


2.4 Stellar Evolution Beyond the Main Sequence 


2 THE FORMATION AND EVOLUTION OF STARS 


2.4.4 Stars of intermediate mass 


There may be some stars of mid-B type on the MS, with 
masses in the interval ~4 — 8 Mo, that neither die quietly 
as white dwarfs nor follow the sequence of nuclear burning 
stages through core collapse and (Type II) supernova explo- 
sion. Such stars would be characterized by carbon ignition in 
a degenerate core, and it is possible that they do not exist if 
the upper limit for white dwarf formation exceeds the lower 
limit for non-degeneracy. If they do exist, their final evolution 
is very uncertain, since it is not known whether degenerate 
carbon ignition would blow a star apart or lead quietly to fur- 
ther burning; this problem is reviewed by Wheeler (1978a). 

Regardless of their final fate, stars of about 4—8M,° make 
important contributions to galactic enrichment in elements 
that are produced during hydrogen and helium burning and 
are mixed up to the surface. Overabundances of carbon, ni- 
trogen, barium and other elements are seen in the spectra of 
many giants in this mass range, and their theoretical origin 
is discussed by Iben & Truran (1978). 

The evolution of stars between 4 and 8 Mo in the HR di- 
agram is subject to the same uncertainties that beset more 
and less massive stars. The groups of clusters in Harris (1976) 
again give a useful guide, but they are rather sparse, as seen 
in Figure 6. 


About half of all supernovae in spiral galaxies are Type I 
supernovae (SNI), which, because of their sites in galaxies 
are associated with stars below ~6 Mo. Their origin has been 
controversial for some time, as reviewed by Tammann (1974, 
1977), Tinsley (1975a, 1977a), and Wheeler (1978a). This 
leaves a serious gap in our understanding of stellar deaths 
and nucleosynthesis, especially since SNI appear to eject sig- 
nificant quantities of iron (Kirshner & Oke 1975). Alternative 
theories include the following. 


(i) SNI are exploding white dwarfs. This has long been 
the standard theory, because SNI occur in elliptical galaxies 
where only stars ~1 Mo are conventionally believed to be dy- 
ing. However, the white dwarf theory conflicts with the large 
envelopes (of supergiant size and 2 1 Mo) that are invoked to 
explain the early spectra and light curves of SNI, and with 
the clear association of SNI with a young stellar population 
in spiral and irregular galaxies (Oemler & Tinsley 1979). 


(ii) SNI are stars ~2 Mo that live for ten times longer 
than normal by mixing completely on the MS. Wheeler 
(1978b) proposed this model to be consistent with the en- 
velope masses, populations in elliptical galaxies, the extreme 
deficiency of hydrogen in SNI spectra, and some properties of 
the Crab nebula. Its problems are that occasional complete 
mixing of MS stars must be postulated ad hoc, and that often 
SNI are associated with a much younger stellar population. 


(iii) SNI are stars of intermediate mass, ~4-6Mo. This was 
suggested by Renzini (1976) on the grounds that these stars 
could blow away their hydrogen-rich envelopes before dying, 
and by Oemler & Tinsley (1979) to account for the association 
of SNI with ongoing star formation in spiral and irregular 
galaxies. This hypothesis implies a rate of star formation in 
elliptical galaxies that should be marginally detectable. 


6 Values of stellar masses will from now on be understood to be 
initial values, unless otherwise stated. 
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(iv) There are two kinds of SNI, those from low-mass stars 
that appear in elliptical galaxies, and those from more mas- 
sive stars that account for the frequency in spiral and irregu- 
lar galaxies (Dallaporta 1973). While this hypothesis avoids 
the problems of the preceding ones, the properties of SNI 
(while not exactly homogeneous) do not divide naturally into 
two classes suggestive of very different stellar origins. 


In any case, the production of iron by SNI and the red- 
giant nucleosynthesis mentioned above show that stars with 
a very wide range of masses contribute to galactic enrichment 
in various elements. 


2.4.5 Effects of initial composition 


Stars now forming and evolving in different regions of galax- 
ies have at least a ten-fold range of metallicities, and the 
first stars everywhere were presumably very metal-poor. It is 
therefore important to ask how stellar evolution is affected 
by initial composition. 


The evolution of stars from ~0.7 — 1 Mo has been widely 
studied as a function of metallicity (Z) and helium abundance 
(Y), for the interpretation of HR diagrams of globular clusters 
(e.g. Iben 1974; Mengel et al. 1979; Sweigart & Gross 1978). 
Much less attention has been paid to more massive stars. In 
the largest systematic survey to date, stars up to 7Mo with a 
wide range of Z and Y have been followed to the base of the 
giant branch (Mengel et al. 1979); Sweigart & Gross (1978) 
have followed some of these up the giant branch. A few more 
massive models have been studied by Trimble et al. (1973) 
and Harris & Deupree (1976), and Alcock & Paczynski (1978) 
have followed the evolution of stars up to 10 Mo, with vari- 
ous Z, through core helium exhaustion. The results of these 
calculations suggest that important systematic differences in 
nucleosynthesis may occur as a result of initial Z differences, 
but they do not go far enough to provide quantitative esti- 
mates. Models for chemical evolution usually assume that the 
production of primary elements — those that are made from 
hydrogen and helium that were initially in the star — is a func- 
tion of stellar mass alone, but this assumption may be lead- 
ing to systematic errors. (An exception is the study under- 
taken by Arnett 1971 of explosive nucleosynthesis of certain 
neutron-rich primary elements which may depend systemat- 
ically on the star’s initial Z; Arnett 1971 showed how such 
effects could be tested using relative abundances of metals in 
metal-poor stars). 


Some elements are believed to be synthesized in amounts 
that depend directly on the initial composition of the star, 
because they are made not only from hydrogen and helium 
but also from heavier elements that were present initially. 
These are called secondary elements, and some examples are: 


(i) PC and 'N are made in stellar envelopes as by- 
products of hydrogen burning in the CNO cycle, mainly from 
12C that was present initially’. 


7 Tf any fresh !?C, a product of helium-burning in the star’s own 
core, gets mixed to the hydrogen-burning region and processed 
in this way, the resulting C and '4N would be called primary 
products. 
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(ii) Barium is made from the addition of neutrons to iron 
nuclei, probably in the intershell region of asymptotic-branch 
giants (e.g. Iben & Truran 1978). 


Because the structure of stars at different stages of evo- 
lution depends on their initial composition, these and other 
elements may also be affected indirectly by initial composi- 
tion. 

Extensive calculations of stellar models lie ahead before the 
dependence of both primary and secondary nucleosynthesis 
on composition is adequately understood. Almost all calcula- 
tions to date have used Solar relative abundances of the heavy 
elements (all scaled with the single initial metallicity param- 
eter Z), and it will be important also to find out how stellar 
evolution and nucleosynthesis depend on variations among 
these relative abundances. 

Another set of questions concerns changes in the HR di- 
agram that would affect the colors of galaxies with stars of 
different compositions. A population of low Z is in general 
bluer than normal for two reasons: the effective temperature 
of a star of a given mass and evolutionary stage is (in most 
cases of interest) higher at lower Z, because metal-poor stars 
tend to have smaller radii; and most colors at a given effec- 
tive temperature are bluer at lower Z, because the reduced 
line-blanketing restores more flux at short wavelengths than 
at long wavelengths. These effects are partially offset when 
one considers populations of a given age, because the turnoff 
mass is lower at lower Z, but this factor is relatively unim- 
portant. Some implications of these changes will be discussed 
in Section 5.1 and Section 7.1.4. 


3 AIMS AND METHODS OF CHEMICAL EVOLUTION 


Studies of chemical evolution aim to account for abundance 
distributions of the elements, including the variation of stel- 
lar metallicities with age and position in the Galaxy, abun- 
dance gradients in galaxies, variations in relative abundances 
of elements heavier than helium, and related observations. 
The main processes governing chemical evolution are indi- 
cated in Figure 1: star formation, nucleosynthesis, mass loss 
from evolving and dying stars, and gas flows. Because of the 
importance of dynamical processes, another motivation for 
studying chemical evolution is that it leads to clues to the dy- 
namical evolution of galaxies, as examples in Sections 4 and 5 
will show. Details of the basic processes have been exten- 
sively reviewed elsewhere, so the emphasis here will be on 
methods of modeling chemical evolution, and selected ap- 
plications. Reviews for further background material include 
Audouze & Tinsley (1976), Faber (1977), Pagel (1978a,b), 
Trimble (1975), van den Bergh (1975), and others cited in 
Section 2. There is no clearer introduction to chemical evolu- 
tion than the seminal paper by Schmidt (1963) on the subject. 


3.1 Basic Assumptions and Equations 


Models for chemical evolution follow, analytically or numeri- 
cally, abundance changes in the ISM of a region and the re- 
sulting abundance distributions in stars. For most purposes, 
the region under study in a particular model can be assumed 
to have ISM of uniform composition and to gain or lose mass 
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through gas flows only; complicated models for whole galax- 
ies are generally divided up, for computation, into zones with 
these properties. The basic equations are then easy to derive. 

The total mass M of the system changes according to the 
net inflow rate f of gas: 


=f (3.1) 


(f is often called the infall or accretion rate). The mass of 
stars M, changes via star formation and mass loss from evolv- 
ing stars: 

dM, 
dt 
where y is the SFR and E is the total ejection rate from stars 
of all masses and ages. Similarly, the mass of ISM, called 
“gas” (M,), changes through star formation, ejection, and net 

inflow into the region: 
dM 
dt 
Since M = M, + M,, the sum of Equations (3.2) and (3.3) is 
simply (3.1). 
The fraction of gas in the system is written 


=w-E, (3.2) 


E =-WtE+f, (3.3) 


Se 4 
HET (3.4) 
so the mass of stars is obviously 

M, = (1 - w) M. (3.5) 


The ejection rate can be written in terms of the IMF and 
SFR, Equation (2.1), if one makes the usual approximation 
that each star undergoes its entire mass loss after a well- 
defined lifetime. Let wm be the remnant mass and Tm the 
lifetime of a star of (initial) mass m; the star was therefore 
formed at time (t — Tm) if it dies at time t. Then the ejection 
rate at time t is 


E(t) = { (m— Wm) Y (t — Tm) Pn) dm, (3.6) 
m 

where m, is the mass with Tm = ¢ (the turnoff mass) and ¢(m) 
is the IMF at time (t—-7,,). The approximation of sudden mass 
loss at the end of each star’s lifetime is usually a reasonable 
one, since for most stars nearly all of the mass loss is thought 
to occur in a final small fraction of their lives; mass loss on the 
MS is important for O stars, however, so the approximation 
fails if timescales < 10’ yr are of interest. 


For illustration, let us consider the evolution of a single 
“metal” abundance parameter Z, where “metals” are one or 
all of the common primary elements (e.g., C, O, Fe), and let 
us assume that their production is a function only of stellar 
mass, not of initial composition. (The methods for calculating 
this schematic Z are readily adapted to more realistic cases). 
Let us also assume for simplicity that material ejected from 
stars is mixed instantly throughout the ISM in the region 
under study. Then the mass of metals in the gas, which is 
ZMg, evolves via star formation (putting metals from the ISM 
into stars), ejection, and gas flows, according to the equation 


dZM, 
dt 


where Ez is the total ejection rate of metals from stars and 


= -Zy + Ez + Zif, (3-7) 


3.2 Analytical Approximations 
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Z; is the mean metal abundance in infalling gas. Ez includes 
both newly synthesized metals and those that were in the 
star from birth and re-ejected. Let p,m be the mass fraction 
of a star of mass m that is converted to metals and ejected; 
then the rate of ejection of new metals from stars at time t is 


f MPimW (t Tm) (m) dm. (3.8) 
Unprocessed material occupies a mass (m -— Wm — Mpm) of the 
ejected part of a star of mass m, and the metal abundance 
in this region is Z(t — Tm). Thus the total ejection rate of old 
and new metals is obtained by adding the ejection rate from 
all masses of these unprocessed metals to the rate (Equa- 
tion 3.8): 


o0 


Ez(t) = f [m — Wm — MP) Z(t -— Tm) + MPrm| W(t — Tm) Pm) dm. 


mt 


(3.9) 


The mean metallicity of stars ever formed, denoted Z,, can 
be obtained by writing an equation for the conservation of 
metals: a mass Z,M, of metals is stored in stars, a mass ZM, 
is in the gas, and their sum is the mass of new metals ever 
ejected, which is obtained by integrating Equation (3.8) over 
time. Thus Z, is given by 


t 100 
Z,M, + ZM, = f ii MP’ (t — Tm) (m) dt’ dm. (3.10) 
0 my 


It is convenient to define some integrals that depend on the 
IMF and parameters of stellar evolution but not on y(t). The 
returned fraction R is defined by the integral 
R= | (m — Wm) b(n) dm, (3.11) 

mı 

where m; is the present turnoff mass, and is usually taken 
to be 1 Mo in evaluating R. Because the IMF is normalized 
as in Equation (2.2), R is the fraction of mass put into stars 
at a given time that is thereafter returned to the ISM in the 
lifetime of a solar-mass star; for example, if the stars in a 
galaxy were all formed in a single initial burst with a total 
mass M, then ~10!° yr later the mass in stellar form would be 
M(1 — R). An estimate of R can be obtained using the local 
IMF as approximated in Equation (2.9), with a white dwarf 
mass of 0.7 Mo as the remnant mass mẹ for m < 4Mo and 
a neutron star mass of 1.4 Mọ for m > 4 Mo; the result is, 
R = 0.17 x (10 Mo pe? Gyr! /y1). 

A second useful integral is the yield y, which is the mass of 
new metals ejected (eventually) when unit mass of matter is 
locked into stars. The yield can be defined for any element of 
interest, and for the “metals” considered above it is given by 
(3.12) 


1 
y= — | mpamd(m) dm. 


1-R 


my 


Again, if a mass M of stars were formed in a single initial 
burst, then ~10!'° yr later, the mass of new metals ejected 
would be y(1 — R). Sometimes the quantities p,m are called 
stellar yields, and y is called the net yield. 

Although R and y are defined with a specific lower mass 
limit, their values do not depend strongly on mı, so they turn 
out to be useful parameters in many situations. For example, 
R has been used in Section 2.3.1 to relate the mass of stars 
now in the Solar neighborhood to the mass ever formed, giv- 
ing a reasonable estimate even though star formation did not 
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occur in a single burst; and the yield will prove to be a first 
approximation to the abundance of metals resulting from a 
given IMF and stellar production parameters (e.g., Pm), in- 
dependently of the detailed history of the system. 

Many insights into chemical evolution can be obtained us- 
ing analytical approximations to the above equations, while 
for detailed modeling they can be cast into forms suitable for 
numerical computation. These techniques will be outlined in 
turn. 


3.2 Analytical Approximations 


The equations of chemical evolution are especially easy to 
handle if one makes an approximation known as instanta- 
neous recycling: stars are divided into two classes, those that 
live forever (masses less than some value m) and those that 
die as soon as they are born (m > mı). Thus the lower mass 
limits in the integrals in Equations (3.6) — (3.10) are taken to 
be the time-independent m,, and the assumed instant deaths 
allow the arguments (t — Tm) to be replaced by simply (f). 
(Note that the assumption now made of death immediately 
after birth, so Tm = 0, is a much stronger and less realistic 
assumption than the earlier one of sudden mass loss, which 
simply allowed Tm to be well-defined for each star). Equa- 
tions (3.6) and (3.9) reduce at once to the much simpler re- 
lations, 


E(t) = RW), (3.13) 
Ez(t) = RZQY() + yA - R) [1 - ZO] WO, 


in which all time-dependent quantities are evaluated at the 
current time t only. Since Z « 1 in all situations of interest, 
Equation (3.14) can be written more simply 


Ez(t) = RZOW) + yA — Ry). (3.15) 
Substituting in Equations (3.2), (3.3), and (3.7), we have now 


(3.14) 


dM, 
=l RW (3.16 
E- = = 
FT Rt (3.17 
dZM, 
=-Z(1- Ry +y(1—- Rw + Zif. (3.18 


dt 


An equation for the metal abundance Z of the gas, rather 
than the mass of metals it contains, is obtained by substitut- 
ing Equation (3.17) into (3.18), using the identity M,dZ/dt = 
d(ZM,)/dt — ZdM,/dt; the result is 
mZ 
dt 
For simplicity, let us assume that the IMF is constant. (The 
methods developed for a constant IMF can be adapted when 
necessary to more complicated cases). The quantities R and 
y are therefore constants. 
Equation (3.16) now gives an expression for the mass of 
stars at time f, 


=y(1 - RY + (Z: - Z) f. (3.19) 


m= (1-8) | wit’) dt’ = (1— Ry. (3.20) 
0 
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3.3 Numerical Models 


The mean metallicity of stars can be obtained from Equa- 
tion (3.10), using the instantaneous recycling approximation 
and the assumption of a constant IMF: the integrals with re- 
spect to time and mass separate, and are simply M,/(1 — R) 
and y(1 — R) respectively, so the result is ZM; + ZM, = yM,, 
from which 


H 
This result shows that Z — y as u > 0, which simply states 
that, when there is no gas, all the metals ever made and 
ejected (yM, ) are incorporated into later generations of stars. 
An estimate of the yield for the local IMF is therefore the 
mean metallicity of stars in the Solar neighborhood (where 
the gas fraction is small, uw ~ 0.05), i.e., y = 0.8 Zo. The ap- 
proximation y ~ Zo will be useful below. 

Further solutions to the above equations depend on the 
assumptions made about gas flows, so two extreme cases will 
be considered for illustration. 


3.2.1 A closed system, initially unenriched gas 


In this case, we set f = 0, and the initial values are Myo = M, 
My = 0, Zo = Zo = 0. Obviously, M = constant. Time can be 
eliminated as an explicit variable if Equation (3.19) is divided 
by (3.17), with the result 


dZ 
M,—— =-y, 3.22 
dM, y ( ) 
i.e., 
M M 1 
z= yin( 2) = yin) =yin(-), (3.23) 


This solution is valid as long as Z « 1. If not, Equation (3.14) 
must be used instead of (3.15); the result is then Z = 1-p, 
which reduces to (3.23) if Z« 1, and has the limit Z > 1 as 
u > 0. (This formally correct limit is of course absurd, and is 
a consequence of neglecting the effects of metallicity on stellar 
nucleosynthesis). Original derivations of these results were 
given by Talbot & Arnett (1971), and Equation (3.23) was 
derived independently by Searle & Sargent (1972). Through 
Equations (3.21) and (3.23), Z, can also be expressed as a 
function of u. 


3.2.2 A system with infall balanced by star formation 


Here it is assumed that star formation just keeps up with the 
rate of infall plus stellar gas loss: 


w= f + Ry, (3.24) 
so that, from Equation (3.17), 
M, = constant = Mp. (3.25) 


The picture is that the total mass grows by infall, while star 
formation maintains a constant gas mass. Other initial con- 
ditions are Myo = 0 and Zo = Zo = 0, and the infalling gas 
will be assumed to be unenriched so that Z; = 0. To eliminate 
time in this case, we divide Equation (3.17) by (3.1), with 
the result 

M dZ 


27m =y-Z. (3.26) 
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It is useful to solve this Equation in terms of a parameter v 
that gives the ratio of mass accreted to the initial mass: 


M=My_M-M,_1_| Ban 
v= = =--1. ; 
Mo Mg H 
Then Equation (3.26) has the solution 
Z=y(1l-e”), (3.28) 


a result due originally to Larson (1972a). Equation (3.21) 
again gives Z. An important difference between this infall 
model and the preceding closed model is that here Z > y as 
H > 0; an equilibrium is set up between the rate of infall of 
metal-free gas and the rate of enrichment by evolving stars. 


3.2.8 Generalities 


Although the above two models are extremely schematic, 
their Equations (3.23) and (3.28) for the “metal” abundance 
of the gas have some common properties that are found in a 
wide variety of models for chemical evolution. The following 
generalizations are approximately true even when instanta- 
neous recycling is a poor approximation, but they break down 
if the IMF is time-dependent. 


(i) Z ts proportional to the net yield, y. Thus any primary 
elements whose stellar production parameters are indepen- 
dent of composition will have abundances in proportion to 
their respective yields. The yields in turn can be calculated 
from Equation (3.12), for a given IMF, independently of any 
model for star formation or gas flows in the system. The util- 
ity of this result is that theories of nucleosynthesis can be 
tested by comparing predicted relative yields of various ele- 
ments with their observed relative abundances. 


(ii) Z/y depends chiefly on current properties of the sys- 
tem, and is insensitive to its past history. This statement is 
exactly true in the above models because of the assumption of 
instantaneous recycling (which led to the cancellation of time 
as an explicit variable). In numerical models that allow for 
finite stellar lifetimes, the same result holds approximately, 
and relevant current properties include the gas fraction (i) 
and the ratio of SFR to gas flow rate (w/f). 


(iii) Z/y depends rather weakly on model-dependent quan- 
tities such as u and w/f. Consequently, regions of galaxies 
with gas fractions differing by orders of magnitude may have 
interstellar abundances differing by only small factors. This 
prediction allows one to test whether theories of nucleosyn- 
thesis give the right absolute amounts of elements, as well 
as the right relative amounts; in particular, the yield of an 
element, calculated for the local IMF, should be in order of 
magnitude equal to its abundance in the Solar System or 
nearby stars. 


3.3 Numerical Models 


Chemical evolution is best studied numerically in cases where 
approximations that make the analytical approach transpar- 
ent (or possible) break down. Situations in which instanta- 
neous recycling is no longer a useful approximation include 
the following: 


(i) If the SFR is a strongly decreasing function of time, 
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systematic errors result from setting W(t — Tm) = Y(t); in par- 
ticular, low-mass stars formed early die later in much greater 
numbers (relative to massive stars) than would be predicted 
on the basis of the current SFR. 


(ii) If one is interested in time-dependent abundance ratios 
arising from nucleosynthesis in stars of different lifetimes, the 
effects would be entirely lost by neglecting finite stellar life- 
times. Even if instantaneous recycling can be assumed, the 
analytical approach becomes intractable in models with time- 
dependent IMF's, or in considerations of radioactive elements 
with lifetimes of billions of years, except in very special cases. 


For numerical computation, equations such as (3.1) — (3.10) 
are expressed as differences in finite steps of time and sums 
over a grid of stellar masses. The programming required is 
little more than careful book-keeping, so the effort in models 
for chemical evolution goes not into techniques but into the 
astrophysical input. Many examples of numerical models will 
be found in the references cited in Sections 4 and 5, in the 
discussion of particular problems. 


4 CHEMICAL EVOLUTION IN THE SOLAR 
NEIGHBORHOOD 


The extensive data available on chemical compositions of 
nearby stars and H I! regions, and details of the isotopic com- 
position of Solar System material, have led to many efforts 
to build consistent models for chemical evolution in the Solar 
neighborhood. In trying to explain the abundance distribu- 
tions, these models also shed light on dynamical processes 
that have affected the local region of the Galaxy, and they 
set some constraints on theories of nucleosynthesis. 

The “neighborhood” must be carefully defined to include 
the range of environments occupied by nearby stars and their 
predecessors in their motions around the Galaxy, and to in- 
clude stars that were formed near the plane but now occupy 
a thicker disk. A small “local swimming hole” around the Sun 
(in Baade’s words) will not do. Instead, the Sun’s “nucleoge- 
netic pool” is usefully defined as a region about a kpc wide in 
the Galactic plane (to include most stellar epicyclic motions 
and periodic gas motions), extended around the Galaxy at 
the Sun’s galactocentric distance (to include passages of local 
material through spiral arms and related small-scale patchi- 
ness), and extending about a kpc out of the plane (to include 
stars whose periodic motions carry them far from the plane). 
This region is shaped like a cylindrical shell, and quantities 
like the mass of stars, SFR, etc. are expressed in units per 
square parsec, referring to an average column at the Solar 
galactocentric distance. Spatial averages over this pool can be 
interpreted as short-term time averages during the history of 
a given small amount of material, and models generally con- 
sider explicitly only timescales longer than passages between 
spiral arms, etc. (Some special studies, such as the interpre- 
tation of elements with radioactive lifetimes ~10° — 10° yr, do 
need to consider short-term changes). Although the region 
has been defined to include most relevant periodic motions, 
it cannot be treated as a closed system, because it is proba- 
bly affected by systematic radial gas flows in the disk and by 
infall from outside, especially at early times. 

Halo stars are sometimes treated as part of the “neighbor- 
hood” (in which case its extent out of the plane must be many 
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Figure 7. The relation between age and metallicity for disk stars in 
the Solar neighborhood (Twarog 1980). Boxes denote the range of 
ages referred to by each point, and the +lo spread in [Fe/H] for the 
age group. Error bars attached to the points denote the standard 
error in the mean. Note that the mean metallicity increases by a 
factor of less than four during the lifetime of the disk. 


kpc), but more often the halo is treated as a separate subsys- 
tem of the Galaxy, whose effects on the evolution of the disk 
are perturbations from outside. Properties of halo stars are 
in any case relevant to the Solar neighborhood, so they will 
be discussed along with the properties of disk stars. 


4.1 Outline of Relevant Data 


Chemical abundances in the Solar neighborhood and in the 
Solar System have been reviewed in detail by Trimble (1975) 
and several authors at a recent conference at Yale University. 
An outline of the points most relevant to this article will be 
given here. Because of systematic radial gradients in chem- 
ical abundances in the Galaxy (Section 5.1), the following 
remarks apply only to stars within a few kpc of the Sun’s 
galactocentric distance. 

Stars with halo kinematics are all metal-poor, with 
[Fe/H] s -1, whereas stars with disk motions nearly all have 
[Fe/H] = —0.5. (As usual, abundance ratios in square brack- 
ets are logarithms of the ratios relative to their Solar values). 
Since most halo stars are probably older than most disk stars 
(Section 1.1), their composition differences indicate a sys- 
tematic increase of metallicity with time of formation. The 
disk stars alone show a systematic increase of metallicity with 
time of formation, illustrated in Figure 7, but the trend is 
only about a factor of three over the whole lifetime of the 
disk. Open clusters show a very weak dependence of metal- 
licity on age, if any, because they are found at such a wide 
range of galactocentric distances that the radial gradient in 
metallicity obscures the time-dependence. 

A powerful tool for chemical evolution is the metallicity dis- 
tribution of all stars ever formed, which is given by samples 
of G-M dwarfs since these stars have remained on the MS 
since they formed. The distribution can be roughly approxi- 
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mated by a log-normal function, with a mean ([Fe/H]) = —0.1 
and standard deviation o ([Fe/H]) = 0.2, allowing for obser- 
vational errors (Pagel & Patchett 1975; Pagel 1978a; and 
references therein). Its striking characteristic is the extreme 
paucity of metal-poor stars: only a few percent of stars in the 
local cylinder (including halo stars) have [Fe/H] < -1. The 
lack of metal-poor disk stars proves to be one of the most 
powerful constraints on models for the Solar neighborhood, 
as discussed below. 

Stellar “metallicities” are commonly denoted [Fe/H] be- 
cause most photometric indices of the abundance of heavy 
elements are sensitive to blanketing by numerous iron lines. 
Detailed abundance analyses show that other elements heav- 
ier than helium usually occur in fixed proportions, so a single 
metallicity index is often adequate. However, the exceptions 
to this uniformity are often of great interest. Certain abun- 
dance variations in evolved stars are believed to show prod- 
ucts of the stars’ own nucleosynthesis that are mixed to the 
surface and lost in a stellar wind (Section 2.4) — direct obser- 
vational evidence for enrichment of the ISM. Some elements 
also show relative abundance variations in unevolved stars, 
and in these cases the anomalies must have been present in 
the ISM from which the star formed. 


Besides these interesting abundance variations, there are 
others that should be ignored or “corrected” for in the context 
of chemical evolution. Extraneous effects include depletion of 
metals in the diffuse interstellar gas due to their incorporation 
into grains; anomalous interstellar abundance ratios, espe- 
cially between isotopes of the same element, caused by chem- 
ical fractionation®; and large abundance variations of various 
elements on the surfaces of Ap and related stars, produced 
by differential diffusion of elements in the stellar atmospheres 
(Trimble 1975). Unfortunately, it is not always clear whether 
a particular composition anomaly is due to some such extra- 
neous effect, to nucleosynthesis in the star observed, or to 
significant composition changes in the ISM. 


4.2 The “G-Dwarf Problem” 


The metallicity distribution of long-lived disk dwarfs leads to 
a problem originally discovered by van den Bergh (1962) and 
Schmidt (1963): there are far fewer metal-poor stars than one 
predicts using the most straightforward assumptions. This 
contradiction is called the “G-dwarf problem”, although a 
stellar metallicity distribution has now been confirmed for M 
dwarfs (Mould 1978). The straightforward assumptions in- 
volved are the following: 


(i) the Solar neighborhood can be modeled as a closed sys- 
tem; 


(ii) it started as 100% metal-free gas; 


8 This is a case of real chemistry, not in the special sense of “chem- 
ical” evolution of galaxies. Other terminology has been suggested 
to avoid the term “chemical evolution” now that real chemistry 
of the ISM is becoming an important topic in astronomy, but the 
alternative suggestions have their own problems. For example, “nu- 
clear evolution” can be confused with the evolution of the nuclei 
(centers) of galaxies, and it obscures the great importance of gas 
flows, in addition to nucleosynthesis, in determining the evolution 
of chemical abundances in galaxies. 
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(iii) the IMF is constant; and 


(iv) the gas is chemically homogenous at any time. 


Any model based on these assumptions is called a “sim- 
ple” model. It is now recognized that assumptions (i) and (ii) 
are almost certainly false and that more realistic alternatives 
lead to a natural solution of the G-dwarf problem; assump- 
tions (iii) and (iv) are probably also unrealistic, but there is 
less evidence for believing that they are violated sufficiently 
to solve the problem entirely. Although the G-dwarf problem 
is no longer regarded as a difficulty in chemical evolution, 
it will be reviewed here because of its earlier importance in 
motivating many models. Moreover, a review of this prob- 
lem serves to emphasize that simple models cannot be used 
consistently in any arguments relating to chemical evolution 
in the Solar neighborhood — the G-dwarf metallicities are so 
much at variance with such models that they must ignore very 
important effects, and could lead to highly misleading results 
in other contexts. This conclusion also leads to suspicion of 
simple models for other regions of galaxies. 


The problem itself can be derived analytically, following 
Pagel & Patchett (1975), using the instantaneous recycling 
approximation and the results for a simple model in Sec- 
tion 3.2.1. Let us consider the cumulative metallicity distribu- 
tion of stars ever formed (which is represented observationally 
by dwarfs of mid-G type and later). From Equation (3.5), the 
fraction of all stars that had been made while the gas fraction 
was > 4 is 


M, l-u 
Msı 1- u’ 


(4.1) 


where subscripts 1 denote present values. From Equa- 
tion (3.23), these stars were formed while the metallicity of 
the gas was < yln w`!. The fraction of stars with metallicities 
< Z, denoted S (Z), is therefore 


s@= - ee 


Msı 
It is useful to eliminate the yield using the relation y = 
Z,/(n aī") (again Equation 3.23), so Equation (4.2) can fi- 
nally be written in terms of the metallicity ratio Z/Z,; and the 
present gas fraction 4: 


(4.2) 


(4.3) 


This distribution is compared in Figure 8 with a log- 
normal approximation to the data. (The model illustrated 
has u, = 0.05, and the shape of the curve is insensitive to 
Hı within the range of likely values). Obviously, there are far 
fewer metal-poor stars in the Solar neighborhood than pre- 
dicted by this formula. The assumptions defining the simple 
model allow only two significant quantities to define a spe- 
cific model: the IMF and the SFR. However, the IMF affects 
only the yield, which does not appear in Equation (4.3) be- 
cause metallicities have been scaled to the present value; the 
SFR was eliminated in the derivation of Equation (3.23), and 
numerical models show that very nearly the same result is 
obtained if the approximation of instantaneous recycling is 
dropped. Thus the failure of the “simple” class of models is 
independent of the particular IMF or SFR used. It may seem 
at first sight that one could eliminate the problem by making 
the metals rapidly using a large SFR at early times; however, 
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Figure 8. Cumulative stellar metallicity distribution functions. S(Z) 
is the fraction of stars that have metallicities < Z, with a maximum 
value Zı. Solid line: log-normal representation of the data for stars 
in the Solar neighborhood; Z; is taken as 2Z since more metal-rich 
stars are very rare; (log Z/Zo) = —0.1, and a(log Z/Zo) = 0.15, al- 
lowing for observational errors (e.g. Pagel 1978a). Long dashes: the 
“simple” model for chemical evolution (Section 4.2). Short dashes: 
an extreme infall model (Section 4.2.1). Dots: a model with a finite 
initial metallicity (Section 4.2.2). 


since there is initially unenriched gas and a constant IMF, a 
fixed quota of metal-poor G dwarfs must form along with the 
massive stars that produce metals — no matter how rapidly 
it all happens. 

Attempts to solve the G-dwarf problem have motivated 
numerous alternative models for the Solar neighborhood, in 
which one or more of the four straightforward assumptions 
are dropped. The following outline considers each assumption 
in turn, and some types of model will be developed from a 
different viewpoint in Section 4.3. Further details and refer- 
ences, on these and other models, can be found in reviews by 
Audouze & Tinsley (1976), Pagel & Patchett (1975), Tinsley 
(1974) and Trimble (1975). 


4.2.1 Infall 


It is unlikely that the region called the “Solar neighborhood” 
has evolved as a closed system. In particular, the disk could 
have grown gradually by accretion, on a timescale similar to 
that for star formation, as proposed by Larson (1972b) and 
Sciama (1972). The effect on the stellar metallicity distribu- 
tion is then as follows (Larson 1972a). Suppose the disk con- 
sists initially of unenriched gas, as in the simple case, but with 
a very small mass (surface density). The first stars formed are 
metal-poor, but their numbers are proportional only to the 
initial disk mass; star formation substantially enriches the gas 
while the disk is still far below its final mass. Then further 
star formation continually enriches the new gas that is ac- 
creted, so ultimately the only very metal-poor stars are the 
few initial ones. 

The model in Section 3.2.2 illustrates this behavior ana- 
lytically. According to Equation (3.28), Z remains essentially 
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constant at the yield value after the disk mass has increased 
to several times its initial value. The stellar metallicity dis- 
tribution, $(Z), can be derived analytically for this model 
(Tinsley 1975b), and the result is a function strongly peaked 
at Z = y, as shown in Figure 8. Here we have a problem op- 
posite to that of the simple model: there are too few stars in 
the model with Z < Z,! The problem is alleviated if the width 
of the observed distribution is due more to errors than has 
been thought, but in any case this particular model is very 
schematic, showing effects of infall in a most extreme way, 
and more realistic models with infall (Section 4.3) predict 
metallicity distributions intermediate between the extreme 
and the simple cases. 


4.2.2 Pre-enrichment of the disk gas 


If the gas that formed the disk had a significant initial metal 
abundance, the number of more metal-poor disk stars would 
obviously be zero. This idea was introduced by Truran & 
Cameron (1971), who postulated a pregalactic burst of mas- 
sive stars. Ostriker & Thuan (1975) considered the effects of 
massive stars in the young halo, whose ejecta enriched the 
gas that was condensing into a disk; the likely importance of 
this effect will be reviewed below (Section 4.3.1). 

The effects of pre-enrichment can be seen analytically by 
simply adding a finite initial metal abundance Zp to the solu- 
tion of Equation (3.22). Then (3.23) becomes Z = Zo+yln w`, 
and the exponent Z/Z, in (4.3) is replaced by (Z—Zo)/(Z, —Zo). 
Figure 8 shows the resulting stellar metallicity distribution in 
the case Z = 0.2Z;, and there is clearly no excess of metal- 
poor stars. A more realistic picture will be discussed in Sec- 
tion 4.3. 


4.2.8 Variable IMF 


The original solution to the G-dwarf problem, proposed by 
Schmidt (1963), was to postulate an IMF richer in massive 
stars at early times; the number of long-lived stars formed 
before the gas reaches a given metallicity is therefore re- 
duced. (An analytical solution illustrating this effect is given 
by Pagel & Patchett 1975). Many alternative models with 
variable IMFs have been considered subsequently, but they 
all suffer from being ad hoc: although arguments can be of- 
fered for massive stars forming more easily at low metallicity, 
they are not compelling, and there is no independent way of 
verifying suggested changes in the IMF in the distant past. 


4.2.4 Metal-enhanced star formation 


Talbot & Arnett (1973) suggested that there are always large 
chemical inhomogeneities in the ISM, and that stars form 
preferentially in regions with above-average metallicity. Tal- 
bot (1974) has developed this idea into a physical model, in 
which the enhanced cooling by heavy elements favors star 
formation in metal-rich regions. For this process alone to 
solve the G-dwarf problem, it would require both very large 
chemical inhomogeneities and very inefficient star formation 
in metal-poor regions; more plausibly, metal-enhanced star 
formation leads only to some reduction in the proportion of 
metal-poor stars. 


In summary, none of the four assumptions underlying the 
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simple model is likely to be strictly true, and departures from 
any or all of them could alter the predicted metallicity dis- 
tribution in the required sense of including fewer metal-poor 
stars. 


4.3 Effects of Galaxy Formation 


In retrospect, the failure of the simple model is not surprising, 
because the Solar neighborhood was treated as a closed box, 
whereas gas flows during the formation of the Galaxy would 
affect chemical evolution strongly in two ways: 


(i) some pre-enrichment of the initial disk gas by massive 
stars in the young halo, and 


(ii) later accretion of metal-poor gas. 


The “formation” of the Galaxy could in the latter sense be 
continuing today. 


4.3.1 Metals from the young halo 


The amount of initial enrichment by massive halo stars can 
be estimated from the present mass of long-lived stars in the 
halo. Let yp be the yield during the time of halo formation. 
Then, by definition of the yield, a mass M, of surviving halo 
stars implies that a mass ~y,M, of metals was made and 
ejected at an early time; of these metals, a mass Z}Mp remains 
locked in halo stars (where Z, is their mean metallicity), and 
the remaining (yn—Zp) Mp either fell to the disk or was lost from 
the Galaxy. Ostriker & Thuan (1975) suggest on geometrical 
grounds that about half of those metals were retained by the 
disk. Now the halo yield, yn, may differ from the disk yield 
because the IMF for low-mass stars appears to differ between 
the two regions (Section 2.2.2), but for a very rough estimate 
we can consider yh ~ Zo > Zn. The mass of metals falling on 
the disk is therefore ~0.5 Z)M,, and the initial metallicity of 
the disk gas is this quantity divided by the initial disk mass. 
The appropriate value of M, is not the mass of halo stars 
in a local cylinder, but the mass (mostly further out) from 
which ejected gas would fall to this part of the disk; since the 
surface density of halo stars is locally only a few percent of the 
disk value, and it decreases with increasing radius (Schmidt 
1975), M, is probably only a small fraction of the present 
mass of the disk in the local cylinder. However, the initial disk 
mass may also have been small. Altogether, it is possible that 
metals captured from the young halo could have been enough 
to have enriched the initial disk gas significantly. A value of 
Zo/Zo ~ 0.2 would completely solve the G-dwarf problem, as 
illustrated above. 

Early enrichment by halo stars appears as an important ef- 
fect in the dynamical models for the formation of disk galaxies 
of Larson (1976c), whose chemical evolution has been stud- 
ied further by Tinsley & Larson (1978). At model radii corre- 
sponding to the Sun’s position in the Galaxy, Z reaches about 
a third of its final value before stars with disk kinematics be- 
gin to form, and the initial disk mass in these models is about 
10% of the final value. As a result, the fraction of local stars 
with Z < Z,./3 is no greater than observed. 

It may be concluded that the disk in the Solar neighbor- 
hood plausibly started with a small fraction of its present 
mass, in the form of gas that had been significantly enriched 
by massive stars in the young halo. 
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4.3.2 Later metal-poor infall 


In the dynamical models just mentioned, the outer disk grows 
on a timescale of billions of years by accretion of metal- 
poor gas; star formation occurs on a similar timescale, so 
the metallicity tends to reach a roughly constant value near 
the yield, as in the schematic infall model of Section 3.2.2. 

A significant additional factor, first suggested by Lynden- 
Bell (1975), is that the ratio of SFR to infall rate (w/f) is 
unlikely to remain strictly constant in a realistic picture; its 
value is more likely to increase as the surrounding gas is used 
up, as indeed occurs in the dynamical models. An asymptotic 
relation between w/f and Z can be derived using the Equa- 
tions of Section 3.2 with the instantaneous recycling approx- 
imation. Let 


(1- Ry = kf, (4.4) 
where k is a constant, so Equation (3.19) becomes 

dZ 
My, = 7 = (ky + Z-Z) f. (4.5) 


According to this Equation, there is an asymptotic value of 
Z, 


Z — ky + Zp, (4.6) 


so the metal abundance of the gas tends toward a value ky if 
Zr = 0. Thus if w/f increases, so does Z, and there is steady en- 
richment rather than the constant Z obtained in Section 3.2.2. 
Examples of this behavior are seen in the numerical models of 
Tinsley & Larson (1978), in particular in the regions between 
~7 and 15 kpc which have general properties resembling the 
Solar neighborhood. The slow increase of Z leads to a less 
peaked stellar metallicity distribution than that in the “ex- 
treme” infall model shown in Figure 8. 


4.3.3 Timescales for chemical evolution 


The original evidence for infall of extragalactic gas was the 
explanation of the high-velocity clouds of neutral hydrogen 
(Oort 1970), which required an infall rate comparable to the 
SFR. Further evidence is given by the timescale for star for- 
mation to consume the ISM in the Solar neighborhood, as 
pointed out by Larson (1972b): the local surface density of 
ISM is ~8 Mo pe while the SFR is ~10 Mo pe? Gyr"! (Sec- 
tion 2.3.1), so the ISM is being used up on a timescale 


_ Ms 
T= TRY ~ 1 Gyr. 
If there is not a source of new gas, flowing in at about the 
SFR, we must be living very near the end of active star for- 
mation in this part of the disk. Various constraints related 
to interstellar pressure balance and Galactic X-ray emission 
allow an infall rate up to ~1 Mọ pe? Gyr"! in the Solar neigh- 
borhood (Cox & Smith 1976). This is enough to extend the 
lifetime of the gas against total consumption by only ~10%, 
but all of the above estimates are rather uncertain, and it is 
possible that f ~ y in the Solar neighborhood, so the gas is 
continually replenished. 

A related timescale is that for chemical enrichment, which 
can be written 


= Z = ZM; 
|dZ/dt| YA — Ry + (Ze - Z) fV 


(4.7) 


TZ (4.8) 
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using Equation (3.19). If infall were negligible, this Equation 
would give tz = T.(Z/y) = Tą ~ 1 Gyr; we would then expect 
to see Z increasing on a timescale of 1 Gyr now, which is 
not consistent with the data in Figure 7. However, if gaseous 
inflow is occurring at a rate comparable to the SFR, tz could 
be much longer. 

Other processes than infall have been suggested to account 
for the slow enrichment rate in the Solar neighborhood. For 
example, various authors beginning with Truran & Cameron 
(1971) have suggested a time-dependent IMF with a greater 
yield in the past; thus at present Z > y, so Tz > T4. 

Whatever such additional processes may affect the evolu- 
tion and distribution of metal abundance, it seems almost 
inevitable that gas flows related to galaxy formation have a 
strong influence on chemical evolution. The dynamical mod- 
els discussed above were designed to reproduce the structural 
properties of galaxies (Larson 1976c), so it is significant that 
they predict both pre-enrichment of the disk gas and later 
metal-poor infall, which thus appear to be the most natural 
solutions to the G-dwarf and timescale problems of the Solar 
neighborhood. From these results, it is to be expected that 
the chemical evolution of other regions of galaxies is affected 
by gas flows; examples will be reviewed in Section 5. 


4.4 Relative Abundances of the Elements 


Relative abundances of elements heavier than helium provide 
information on both nucleosynthesis and galactic evolution, 
in a variety of ways that are discussed extensively in the liter- 
ature (see reviews by Audouze & Tinsley 1976, Pagel 1978a, 
and Trimble 1975). Here the emphasis will be on points rele- 
vant to galactic evolution. 


4.4.1 Primary elements from different stars 


Stars of different masses eject new primary elements in dif- 
ferent proportions, so if the ISM is not well mixed we expect 
to see some variations in relative abundances on the surfaces 
of unevolved stars. (Variations would arise similarly from any 
dependence of primary nucleosynthesis on the initial compo- 
sitions of stars, Section 2.4.5). Until recently, most reported 
variations were within the uncertainties of measurement and 
analysis, but some trends are now emerging. 

The relative abundances of carbon, oxygen, and iron (for 
which the data are reviewed by Pagel 1978a) are an interest- 
ing example. All unevolved stars with reliable carbon abun- 
dances have [C/Fe] = 0 within the errors, i.e. a Solar abun- 
dance ratio’, but oxygen is systematically overabundant in 
metal-poor stars, a typical value being [O/Fe] = 0.5 in stars 
with [Fe/H] < —1. Since these stars have halo kinematics, it 
appears that the yield of oxygen was relatively high during 
the early life of the Galaxy. Oxygen is the most abundant 
element after H and He, so it would be more appropriate to 
say that the yield of iron (and perhaps carbon) was relatively 
low. 

One possible explanation is that the IMF was different, 


° This statement is tentative, since very few metal-poor stars have 
quantitative carbon abundances and some globular clusters show 
preliminary evidence for carbon excesses and/or deficiencies (Mc- 
Clure 1979). 
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favoring massive stars, which have a higher oxygen/carbon 
ratio in their ejecta according to current stellar models (e.g. 
Arnett 1978). Another possibility is that significant fractions 
of iron and carbon are ejected by stars with such long lives 
that they die too late to enrich the halo stars. This idea 
agrees qualitatively with the mass dependence of stellar nu- 
cleosynthesis outlined in Section 2.4: carbon stars (and plan- 
etary nebulae) with initial masses from ~1 — 8 Mo indicate 
that carbon production takes place partly in very long-lived 
stars; Type I supernovae, with iron-enriched envelopes, show 
that iron is ejected at least partly by stars of $ 6 Mo; but 
there is no evidence for oxygen production outside the mas- 
sive stars (= 10 Mo) that are theoretically predicted to be its 
main source. Models showing the effects of nucleosynthesis in 
stars with such a wide range of lifetimes have been studied by 
Tinsley (1979). It is found that systematic variations in rel- 
ative abundances arise if star formation occurs on timescales 
less than the lifetime of the least massive star producing sig- 
nificant amounts of an element of interest. For example, iron 
produced by stars of intermediate mass (~4 — 6 Mo) is un- 
derabundant relative to oxygen if stars form on a timescale 
~108 yr. Even if the whole time for halo formation is several 
billion years, as indicated by the age spread of globular clus- 
ters (Section 1.1), such variations will appear in halo stars 
if they were formed in a series of short bursts. A very inho- 
mogeneous protogalaxy, with stars forming in dense lumps, 
has been suggested on dynamical grounds (Section 1.2), so 
it is interesting to note how chemical abundances could be 
affected. 


4.4.2 Secondary elements 


Secondary elements are those synthesized from “seed” ele- 
ments heavier than helium that were already present in the 
star at birth; some examples have been mentioned in Sec- 
tion 2.4.5. The stellar yields of these elements increase with 
metallicity, so their abundances are expected to increase more 
rapidly (and to be relatively lower in metal-poor stars) than 
those of primary elements. As reviewed by Pagel (1978a), this 
prediction is borne out qualitatively by relative underabun- 
dances of nitrogen and barium in metal-poor stars, but the 
data are in quantitative disagreement with theoretical mod- 
els. 

The canonical prediction for secondary abundances is that 
they increase as the square of primary abundances, e.g. 
Ba/H « (Fe/H)? or Ba/Fe « Fe/H. This prediction arises, as 
follows, from the conventional assumption that a star of given 
mass produces and ejects an amount of secondary element in 
direct proportion to the star’s initial primary seed abundance. 
Let us consider a secondary element “s”, with primary seed Z, 
and let Psm be the mass of new s ejected by a star of mass m 
with initial metallicity Zo; with another initial metallicity Z, 
the amount of new s ejected is Psm(Z/Zo). Thus the expression 
like (3.8) for the rate of ejection of new s is 


~ Z(t- m 
f mpa E T — Tm) (m) dm. (4.9) 
mt Zo 
The instantaneous recycling approximation to this expression 
can be written y,(Z/Zo)(1 — R)w, where Z and w are current 
values. The equation analogous to (3.19) for the abundance 
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Figure 9. Ratios of primary to secondary abundances versus metal- 
licity. Data points (and shaded area with many points) are from 
sources cited by Tinsley (1979), and dashed lines correspond to a 
simple scenario for secondary nucleosynthesis (Section 4.4.2). The 
nitrogen data are all for dwarfs; barium data include giants, but 
these do not give a different picture from the dwarfs alone, so there 
is probably no significant contamination by the stars’ own nucle- 
osynthesis. 


X, of s is therefore 


dx, 


ea (4.10) 


Z 
=»(Z Ja -RY + (Xs - Xs) f. 
© 


In the simple model of Section 3.2.1, the equation analogous 
to (3.22) is 


(4.11) 


Now Z is given by Equation (3.23), so (4.11) can be solved 
for X,, with the result 


1 : 
X, oe Ys Jz 
2\yZo 


which is the canonical prediction X, « Z? mentioned above. 
Since Solar abundances are near the mean disk values, they 
must also be approximately the yield values, so the prediction 
is equivalent to X,/Z « Z with a zero-point near the Solar 
values. 

Stellar nitrogen and barium abundances are plotted in Fig- 
ure 9, where the canonical relation would correspond in each 
case to the straight line at 45° through about the Solar val- 
ues (zeros in the logarithmic notation). It can be seen that 
the nitrogen abundances scatter greatly about the predicted 
relation, mainly exceeding the predicted value in metal-poor 
stars; and barium abundances are almost all greater than pre- 
dicted, no matter where one places this zero-point among the 
data for stars near Solar composition. 


(4.12) 


What is wrong? There is a logical flaw in making predic- 
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tions with the simple model because it is known to be unten- 
able on account of the stellar metallicity distributions (Sec- 
tion 4.2); furthermore, instantaneous recycling may be a bad 
approximation for secondary elements that are made in red 
giants down to initial masses ~1 Mo. Plausible modifications 
to the above theory lead to no better agreement, however, as 
the following examples show (Tinsley 1979). 


(i) Numerical calculations allowing for delayed ejection by 
long-lived stars make the situation worse, because they pre- 
dict even smaller secondary abundances in the most metal- 
poor stars. 


(ii) Infall makes the situation even worse, because it pre- 
dicts a steeper rise than X,/Z « Z* at the metal-rich end. 


The latter result can be derived simply in the extreme infall 
model of Section 3.2.2. Use of Equation (4.10), with Xs = 0 
and (3.28) for Z, leads to a result analogous to (3.28), 


(4.13) 


This formula reduces to Equation (4.12) in the limit of low 
abundances, but its asymptotic limit, X, > y,y/Zo, is twice 
the value obtained by extrapolating (4.12) to the asymptotic 
value Z > y. Thus a plot of X,/Z versus Z for this model would 
have a slope of 45° at first, but would become steeper at the 
top, obviously in disagreement with the data in Figure 9. 

From these results, it appears that the problem in Figure 9 
is not due to a simplistic choice of model for chemical evo- 
lution, but to wrong assumptions about nucleosynthesis. For 
nitrogen, it has often been suggested that a primary source 
of nucleosynthesis could contribute significantly (e.g. Tru- 
ran & Cameron 1971), and this component of the nitrogen 
abundance would make the ratio N/Fe more nearly constant. 
A possible scenario for primary nitrogen production is that 
some carbon produced by helium burning in stars of interme- 
diate mass gets mixed into the hydrogen-burning shell region 
and there converted to nitrogen; the models in Iben (1976) il- 
lustrate this possibility. Nitrogen abundances in H II regions, 
in our own and other galaxies, show a scatter similar to the 
stellar data, and primary production has again been proposed 
as an explanation (e.g. Smith 1975; Edmunds & Pagel 1978). 

For barium there is no obvious solution to the discrepancy 
in Figure 9, since a primary source is unlikely in terms of 
nuclear physics. The calculations of stellar structure and nu- 
cleosynthesis used to predict barium production are so com- 
plicated that results for Solar initial metallicities cannot be 
extrapolated to others (Iben & Truran 1978); it is there- 
fore likely that the conventional assumption, that production 
scales directly with the initial Fe/H of a star of given mass, 
is wrong. 


Other aspects of secondary nucleosynthesis relevant to 
galactic evolution are discussed in the references cited at the 
beginning of Section 4.4 and in the proceedings of a meeting 
on CNO isotopes in astrophysics (Audouze 1977). 


4.4.3 Radioactive elements 


The field of nucleochronology (or cosmochronology or nucleo- 
cosmochronology) uses meteoritic abundances of the decay 
products of certain radioactive elements to derive information 
on the age distribution of elements in the early Solar System. 
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This information is relevant to the rates of star formation 
and nucleosynthesis before the Sun formed, and to the age of 
the Galaxy. Nucleochronology has been reviewed recently by 
Fowler (1979), Schramm (1974), and Trimble (1975), and its 
relevance to galactic evolution is discussed further by Reeves 
(1972), Reeves & Johns (1976), and in other papers referenced 
below. Some general principles will be outlined here. 


Let us consider the abundance X; of a radioactive nuclide 
with decay constant 4; and net yield yj. The equations of 
chemical evolution must be modified to allow for decay in 
the ISM and in stellar envelopes; the equation analogous to 
(3.7) is 
dX\M, 

dt 
where E; is given, analogously to Equation (3.9), by 


= -Xy + Ei + Xf - A,X,M. $ (4.14) 


Ex(t) = Flom — Wm — Mpim) Xi (t — Tm) e%i™ + mpim| W(t— Tm) (m) dm. (4.15) 
mt 

The decay timescale, at, is of course independent of the sys- 
tem’s timescales for star formation and infall, so straightfor- 
ward analytical solutions for X; are possible only in special 
cases, even if instantaneous recycling is used. In fact, the long- 
lived radioactive elements are useful just because their abun- 
dances are sensitive to evolutionary details (such as the past 
SFR and the age) that cancel in the instantaneous recycling 
approximation and hardly affect other abundance parameters 
in detailed models. 

The simplest situation would be if all the elements formed 
in an initial burst. Then their decay in the ISM and stel- 
lar envelopes would lead to abundances at time t given by 
Xi(t) = Xio exp(—4it), where Xio is the initial abundance and 
is proportional to the amount of element i made, i.e. to its 
yield y,. In particular, at the time T when the Solar System 
formed, two such elements would have an abundance ratio 


AEE 


XiT) Xio 
where Xj9/Xjo = yi/yj- The value of T could in this case be 
found by substituting into Equation (4.16) the yield ratio 
(given by nucleosynthesis theory) and the abundance ratio 
in the early Solar System (obtained from the abundances of 
decay products in meteorites). The age of the Galaxy, which 
is simply T plus the age of the Sun, could then be determined. 
Of course, the elements were probably synthesized contin- 
ually during the period T from first star formation in the 
Galaxy until the Solar System condensed, so Equation (4.16) 
is unrealistic. As first shown by Schramm & Wasserburg 
(1970), one can define a useful age parameter, here denoted 
Tj, analogously to the solution of (4.16) for T: 


(4.16) 


vil; 


T; — : in| | , 
A Ay [XTX T) 


(4.17) 


which can, in principle, be evaluated from meteoritic and nu- 
clear data independently of Galactic evolution. In the limit of 
long-lived elements, (4T « 1, A4jT « 1), Tj is just the mean 
age of elements in the gas, Tz, at the time when the Solar 
System formed (Tinsley 1975c). 

The relation between the mean age (Tz) and the elapsed 
time (T) is model-dependent, so estimates of T;j for long-lived 
pairs of elements do not give T directly. Different possibilities 
include the following. 
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(i) If essentially all nucleosynthesis of the relevant elements 
took place in an initial burst, then T = Tz. 


(ii) The simple model for chemical evolution gives the in- 
tuitive result that the mean age is half the elapsed time, 
Tz = T/2; however, because this model is discrepant with stel- 
lar metallicities, one cannot assume that T is given simply by 
2Tz in reality. 


(iii) In extreme infall models, the ISM and heavy elements 
in it have a mean age ~M,/f at all times greater than M,/f; 
so if chemical evolution in the disk was strongly affected by 
infall before the Solar System formed, the value of Tz obtained 
from meteoritic abundances may reflect only the timescale for 
infall, independently of the age T. 


(iv) There are consistent models for the Solar neighbor- 
hood, with some infall and/or some early enrichment, that 
have values of Tz ~ T/2 (as emphasized by Hainebach & 
Schramm 1977), but since not all plausible models have this 
relation it cannot be used confidently (as emphasized by Tins- 
ley 1977c). 


In summary, there is a large uncertainty in any age estimate 
of the Galaxy derived from nucleochronology, except of course 
that the age of the Solar System is a reliable lower limit! 


The initial Solar System abundances of short-lived radioac- 
tive elements are sensitive to rates of nucleosynthesis at times 
immediately preceding the solidification of the meteorites. 
Their abundances suggest that the nucleosynthesis of most 
elements ceased ~10° yr before the solidification time, but 
some material was produced only ~10° yr earlier. Interpre- 
tations of these timescales include passage of the pre-Solar 
material through spiral arms at 10°-yr intervals; enrichment 
by fresh supernova ejecta each 10° yr, which could result from 
the average supernova rate in the Solar neighborhood; a last- 
minute supernova that triggered the formation of the Solar 
System; and locking of radioactive elements, with their decay 
products, into grains long before the Solar System formed. 
These possibilities are reviewed briefly by Podosek (1978), 
and discussed in detail by several authors in a conference 
proceedings edited by Gehrels (1978). As yet there is no con- 
sensus on the interpretation of short-lived radioactivities in 
the early Solar System, but ultimately they should provide 
valuable information on star formation and interstellar chem- 
istry. 
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For other galaxies, and for regions of our own outside the So- 
lar neighborhood, there is much less detailed information on 
abundance distributions, but there are some striking trends 
that call for explanations involving the formation and later 
dynamical evolution of galaxies. A few relevant properties 
have been mentioned in Section 1.1, and now further de- 
tails will be described and some of the theoretical models 
reviewed. Other general reviews of this subject include those 
by Audouze & Tinsley (1976), Pagel (1978b), and Trimble 
(1975). 
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5.1 Outline of Relevant Data 


Abundances are very often found to decrease outward in 
galaxies: gradients have been observed in the H I! regions 
of disks, in disk stars, and in the stars of spheroidal systems 
including elliptical galaxies and the bulge-halo components 
of spirals. 


In the Galactic disk, H 1 regions within a few kpc of the 
Sun have an average gradient d[O/H]/dr ~ —0.1 kpc! (where 
r is the galactocentric distance), while stars of intermediate 
age have a gradient d[Fe/H]/dr =~ —0.05kpc7!; an open cluster 
about 10 kpc from the Sun in the anticenter direction, of age 
only < 10° yr, is apparently as metal-poor as [Fe/H] < -1 
(Christian & Janes 1979). (Oxygen and iron abundances are 
quoted for the ISM and stars, respectively, because these are 
the best observed elements). The uncertainties are such that 
the apparent age dependence of the gradient may or may 
not be real. These data are reviewed by Janes (1977), Mayor 
(1977), and Peimbert (1977). H 11 regions in external galaxies 
generally show gradients of a similar magnitude (e.g. Smith 
1975; Shields & Searle 1978; and references therein). However, 
only a marginal gradient appears in the Large Magellanic 
Cloud and none in the Small Cloud (Pagel et al. 1978). 

The most obvious explanation for these gradients would be 
that the outer regions of disks are less chemically evolved than 
the inner regions, in the sense of having converted a smaller 
fraction of their gas to stars. The simple model, for example, 
would predict a Z gradient given by Z = yIn yp! arising from 
a gradient in the gas fraction u (Section 3.2.1). However, the 
best studied galaxies (the Milky Way and M101) probably 
do not have a sufficient gradient in u for this explanation to 
suffice. Gordon & Burton (1976) found that the combined 
surface densities of atomic and molecular hydrogen lead to 
a nearly constant gas fraction (u ~ 0.05) at R > 4 kpc in the 
Galaxy, and Shields & Searle (1978) noted a similar prob- 
lem in M101. The amount of ISM interior to the Sun could 
be overestimated, since the H) density is derived from ob- 
servations of CO molecules on the assumption of a constant 
abundance ratio CO/H); if in fact CO/H, increases inward 
because of a C/H abundance gradient, then there is less H3 
than had been thought at small radii (Peimbert 1977). With 
this correction, the Galaxy probably has some gradient in u, 
which could account in part for the interstellar composition 
gradient. Of course, the simple model is known to be invalid 
in the Solar neighborhood, so we do not expect the formula 
Z = yln pc! to explain gradients in detail. Other ways of 
generating gradients will be mentioned in Section 5.5. 

The Galactic halo stars also have a metallicity gradient. 
Studies of individual stars in globular clusters show that a 
spread of metallicities of [Fe/H] ~ —2 to 0 occurs in the inner- 
most 5 kpc (measured from the Galactic center, at any angle 
to the disk), while the upper limit drops to ~—1 at greater 
radii (Cowley et al. 1978; Searle & Zinn 1978). It is not clear 
whether a systematic decline in iron and/or CNO abundances 
persists further out (McClure 1979; Kraft 1978). 


Many elliptical and S0 galaxies have gradients of integrated 
colors and line strengths in their spectra that are best inter- 
preted as composition gradients. The same quantities also 
vary systematically with the absolute magnitudes of E and 
SO galaxies, as illustrated in Figure 10, indicating that the 
brighter galaxies are more metal-rich. A thorough review of 
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Figure 10. A color-magnitude diagram for elliptical and SO galaxies 
in several clusters of galaxies (Visvanathan & Sandage 1977). The 
color (u — V) is corrected for reddening, redshift, and aperture 
effects; magnitudes are adjusted to the distance of the Virgo cluster 
using redshift ratios. (Crosses denote Virgo galaxies not used by 
Visvanathan & Sandage 1977 in their linear fit to the data). The 
straight lines are a linear fit, with +20 boundaries. Despite the 
excellent linear fit over a large range of magnitudes, the brightest 
few points for the Virgo cluster alone (filled circles) and for other 
clusters (open circles) show a tendency to level off in color. 


this subject is given by Faber (1977). The analysis and cali- 
bration of abundance effects in the integrated light of a galaxy 
are much more complicated than for individual stars, because 
line strengths are strongly affected by the mixture of stellar 
temperatures and luminosities, and because the whole popu- 
lation in the HR diagram is shifted by effects of metallicity on 
the interiors and atmospheres of stars (Section 2.4.5). Until 
recently, it was not clear whether all elements heavier than 
helium enter the composition trends in E and S0 galaxies, or 
whether a few with strong spectral features (N, Mg, Na), are 
mainly responsible. Cohen (1978) has now made a detailed 
observational study of some lines of Ca, Na, Mg, and Fe, to- 
gether with approximate theoretical predictions of how their 
strengths should vary with the composition of a population 
of stars; she finds no evidence against the abundances of all 
of these elements varying in the same proportions. 

The color-magnitude relation for elliptical galaxies is linear 
over a wide range of metallicities (Figure 10), which suggests 
a power-law relation between metallicity and luminosity. A 
tentative calibration, subject to revision when both the data 
and the theoretical color—metallicity relation are more secure, 
is 


Z; œ L8? (5.1) 


(Tinsley 1978b), where Z, is the average metallicity of stars 
in an elliptical galaxy of blue luminosity Lg. This relation was 
derived by from population models differing only in metallic- 
ity, so that the stars in all cases had the same IMF and age (as 
outlined in Section 6.1.2). Such models also predict that the 
mass-to-luminosity ratio should depend on metallicity, yield- 
ing a relation M,/Lg œ Lys A relation very similar to this has 
been obtained observationally by Schechter & Gunn (1979) 
for the cores of elliptical galaxies. Equation (5.1) therefore 
corresponds to a tentative metallicity—mass relation, 


Z, œ M”, (5.2) 
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where M, is the mass of stars (not any extended hidden halo 
material), and the main uncertainties in the exponent are 
due to the color-magnitude data and to the color—metallicity 
calibration. There is some evidence that the color-magnitude 
relation levels off at the magnitudes of the brightest cluster 
galaxies, as can be seen, for example, from the brightest few 
points in Figure 10. 


Heavy elements have been detected in intergalactic gas, 
and since these elements almost certainly come from galactic 
stars they are relevant to the chemical evolution of galaxies. 
A feature due to iron has been observed in the diffuse X-ray 
emission spectra of several rich clusters of galaxies; the in- 
terpretation is that these clusters contain hot gas (~10° K), 
emitting by thermal Bremsstrahlung, with approximately So- 
lar abundances of iron. The mass of intergalactic gas inferred 
from the data is model-dependent (e.g. Bahcall & Sarazin 
1977; Perrenod 1978), and is roughly between 1 and 30 times 
the total luminosity of cluster galaxies, in solar units. Now the 
mass of stars in galaxies is ~10 times their total luminosity, in 
solar units!°, and their average metallicity is approximately 
Solar, so the intergalactic medium (IGM) in the rich clusters 
apparently contains about the same total mass of iron as do 
the stars themselves. These observations suggest that galax- 
ies sometimes lose enough metal-rich gas to affect their own 
chemical evolution substantially. 

Another striking observation of metal-rich IGM is ab- 
sorption due to Ca and Mg in the spectrum of a quasar 
(3C 232) lying 1.9’ on the sky from a foreground spiral galaxy 
(NGC 3067); the absorption redshift matches that of the 
galaxy, and neutral hydrogen absorption has been detected 
at the same redshift in the radio spectrum of the quasar 
(Haschick & Burke 1975). The line strengths and positions of 
the objects imply that there is gas with roughly Solar abun- 
dances at least 17 kpc from the center of the galaxy (Boksen- 
berg & Sargent 1978). 


5.2 Abundance Gradients in Spheroidal Systems 


Most stars in elliptical galaxies and in the bulge-halo compo- 
nents of spirals were probably formed within a few times 10° yr 
at a time ~10! yr ago. The abundance gradients in these sys- 
tems therefore reflect processes occurring during the time of 
star formation in a protogalaxy; several such processes have 
been suggested as possible causes of gradients. 


5.2.1 Dissipative collapse 


The most extensive models exploring the effects of star for- 
mation during the collapse of an initially gaseous protogalaxy 
are those reviewed by Larson (1976a). In the spheroidal com- 
ponents of model disk galaxies, and in model ellipticals, star 
formation becomes increasingly concentrated toward the cen- 
ter as the density builds up. This effect is illustrated in Fig- 
ure 11 (a), which shows the SFR as a function of time at 
several radii in a spherical collapse model. Stars formed at a 
given radius remain in orbits with little net inward motion, 


10 This mass is not to be confused with the virial masses of clusters, 
which are ~100 times the total luminosity and which provide the 
main evidence for hidden non-stellar matter in association with 
galaxies. 
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but the gas sinks further in because it is dissipative (i.e., its 
kinetic energy of radial motion is partly lost via collisionally 
induced radiation). Thus the metals ejected by evolving stars 
are carried inward by the gas, and an abundance gradient 
develops in the gas. As stars continue to form, their compo- 
sitions reflect this gaseous abundance gradient. Figure 11 (b) 
shows the evolution of metallicity of newly formed stars (i.e., 
Z of the gas) at several radii in a spherical model, and the 
rapid development of a gradient is clear. The same process of 
dissipation produces a central concentration in the gas den- 
sity, which leads to a condensed nucleus of stars. 

If there were no dissipation, the stars and gas would col- 
lapse together and the metals would not be concentrated 
inward. Thus in the outer parts of some of these models, 
where the protogalactic density is too low for dissipation to 
be effective, no stellar abundance gradient appears. The pos- 
sible lack of a gradient in metallicities of globular clusters 
beyond ~10 kpc from the Galactic center has therefore been 
interpreted as showing that the collapse of the Galaxy began 
with the stars and gas in free-fall; conversely, the gradient at 
smaller radii is interpreted as showing the effects of dissipa- 
tion at a later stage of the collapse (Larson 1977a). 


5.2.2 A gradient in the IMF 


Aside from any effects of gas flows, negative metallicity gradi- 
ents could be produced by gradients in the IMF that led to a 
yield decreasing outward. Since the yield (Equation 3.12) de- 
pends on the relative numbers of metal-producing stars, pos- 
sibilities would be a steeper slope for massive stars or more 
low-mass “inert” stars, at larger radii. In the latter case, the 
stars that survive to the present would still have a radial gra- 
dient in their mass function, with an interesting consequence: 
most of the luminosity of an old population of stars comes 
from dwarfs near the MS turnoff and evolving giants, while 
most of the mass is in less massive objects that contribute 
little light; thus the M/L ratio increases with the proportion 
of very low-mass stars, and the postulated gradient in the 
IMF would lead to an outward increase of M/L. Such a trend 
is indeed observed, although it is sometimes ascribed to an 
extended “halo” of non-stellar condensed objects that formed 
too soon to affect chemical evolution (Section 2.2.2). van den 
Bergh (1975) has suggested that the IMF tends to have more 
massive stars in regions of high density, and this view of the 
origin of metallicity gradients is part of his evidence. The 
hypothesis of a gradient in the IMF in spheroidal systems 
has no convincing theoretical basis, and the trends it would 
explain can arise in other ways, but nevertheless systematic 
variations in the IMF could be as important as they are hard 
to verify. 


5.2.38 Finite stellar lifetimes 


The timescale for star formation in a protogalaxy could be 
comparable to the lifetimes of some metal-producing stars, in 
which case stars formed early would be relatively unenriched. 
Thus if the outermost stars formed before most of the central 
ones, there would be a negative metallicity gradient. In the 
models in Section 5.2.1 (Larson 1976a) it is assumed that 
all metals are produced by stars with lifetimes < 3 x 10’ yr, 
so this effect is negligible; but iron production by Type I 
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Figure 11. (a) Star formation rates and (b) metallicities of newly formed stars (i.e., Z of the gas), at several radii in a collapse model for 
the formation of a spherical galaxy (Larson 1974a). The radius in pc is marked on each curve, and the three ticks indicate the times at 
which star formation is 10%, 50%, and 90% complete (relative to the final mass of stars) at that radius. 


supernovae could in fact be significant on longer timescales 
(Section 2.4.4). What timescales are relevant? The minimal 
collapse time for a protogalaxy is the free-fall time, 


Nive 


yt, (5.3) 


1 
M \2?[{ R 
te = 1.7 x 106 | — — 
10!! Mo 1 kpc 


where M and R are the mass and radius. For example, a galaxy 
with M = 2x 10!! Mo and R = 15 kpc has tẹ = 7 x 10’ yr; and a 
protogalaxy of the same mass collapsing from R = 50 kpc has 
tg = 4x 108 yr. Much longer timescales for star formation are 
possible if the dissipation is slow, and the collapse time of the 
system can be much longer if its boundary initially expands 
with the Universe (Gunn & Gott 1972). At least the outer 
parts of large galaxies could therefore be metal-poor partly 
because of the finite lifetimes of metal-producing stars. 

A potential test is to look for variations in relative abun- 
dances. For example, if oxygen comes only from very massive 
stars but iron comes partly from stars of intermediate mass 
(Section 2.4; Section 4.4.1), then iron should be more de- 
ficient than oxygen in the outermost stars. The hypothesis 
of a gradient in the IMF of massive stars would predict the 
opposite trend in relative abundances. Current data do not 
detect any gradients in relative abundances, but oxygen itself 
has not been studied and nor have the faint outer regions of 
elliptical galaxies. 

It is quite possible that all of the processes discussed above 
were effective in producing abundance gradients in spheroidal 
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systems, so clear choices among the theories are not to be 
expected. 


5.3 The Metallicity-Mass Relation for Elliptical Galaxies 


The correlation between metallicity and mass (color and lu- 
minosity) of elliptical galaxies has been explained in several 
ways, of which two will be reviewed here. These each involve 
dynamical effects during galaxy formation, resulting in less 
complete conversion of the protogalactic gas to stars, and so 
to a smaller final mean stellar metallicity, in smaller systems. 
One could, of course, invoke differences in the IMFs of galax- 
ies as a function of their mass, but there is no independent 
evidence for a trend of the required form. 


5.3.1 Supernova-driven winds 


Star formation and chemical enrichment are cut off in a pro- 
togalaxy if the residual gas is lost, and a possible loss mech- 
anism is a galactic wind energized by supernova explosions. 
Galactic winds were first analyzed for a steady-state case by 
Johnson & Axford (1971) and Mathews & Baker (1971); sim- 
ilar analyses have been made for nuclear bulges of spirals 
by Faber & Gallagher (1976) and for bulge-disk systems by 
Bregman (1978). A galaxy sustains a steady-state wind if the 
supernova rate divided by the rate of supply of gas (from 
evolving stars) gives the gas enough energy to escape from 
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the galactic potential well. For protogalaxies, we are inter- 
ested not in the steady state, but in conditions for the initia- 
tion of a wind that can remove essentially all of the residual 
gas. Larson (1974b) discussed possible effects of supernovae 
in heating the gas, and adopted a simple approximation as 
the condition for its removal: the residual gas is assumed to be 
lost suddenly when the total heat input from supernovae has 
provided the gas with the escape energy, assuming uniform 
conditions throughout the protogalaxy. This approximation 
is plausible enough to suggest how the loss condition scales 
with the relevant parameters, but there are unavoidably large 
uncertainties in the astrophysics involved so the results are 
not very secure. The scaling can be derived as follows. 


Let E be the thermal energy imparted to the ISM by super- 
novae when a unit mass of stars is formed; E is proportional 
to the fraction of stars that become supernovae, to the mean 
kinetic energy of material ejected in a supernova explosion, 
and to the efficiency with which this energy is transferred to 
the ISM as heat. (The last factor is the most uncertain). As 
an approximation, let E be treated as a constant, despite fi- 
nite stellar lifetimes and complicated effects of the clumpiness 
of the ISM, its chemical composition, etc. Let us consider a 
spherical protogalaxy of mass M that has formed a mass M, 
of stars and has residual gas mass M, = M—M,. The condition 
for gas to escape can be written 


Potential energy of gas = Energy from supernovae, 
i.e, 
GMM, 


K = 
R 


where K depends on the density distribution in the galaxy and 
will be assumed constant as another simplification. Large el- 
liptical galaxies are observed to be more tightly bound than 
small ones, so a greater fraction of their gas must be con- 
verted to stars before the condition (5.4) is satisfied; there- 
fore, their surviving stars have a greater mean metallicity. 
Other consequences of this scenario are that the more mas- 
sive galaxies collapse more extensively before star formation 
is cut off, so they are predicted to have more condensed nu- 
clei and steeper metallicity gradients than smaller galaxies 
(Section 5.2.1). These trends are observed, lending support 
to this type of origin for the increase of metallicity with mass. 

The form of the metallicity—-mass relation can be accounted 
for using the same approximate model. Let the initial mass— 
radius relation for protogalaxies have the form 


EM,, (5.4) 


M œ R°, (5.5) 
so Equation (5.4) can be written 
M™ =? (M - M;,) œ M,. (5.6) 


Asymptotic equations for the mean metallicity of stars can be 
derived from very general considerations: the mass of metals 
synthesized and ejected is yM,, so at early stages of evolution 
when M ~ Mg, we have approximately 


yM; _ yMs 
o x 


M, M 


Z, , (Z, «& y). (5.7) 
At late stages, Equation (3.21) predicts that in all cases where 


mass is conserved, 


Z, > y, as M, > M. (5.8) 
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The results from numerical collapse models verify these re- 
lations in cases of interest here. Substituting Equation (5.7) 
into (5.6), we find the stellar metallicity—mass relation, 


a-l 
Z œ MÈT, (Z <y). (5.9) 


The tentative empirical relation (5.2) is obtained if a = 1.5, 
which agrees fairly well with the observed mass—radius re- 
lation for elliptical galaxies if one considers how the stellar 
system must swell (to conserve energy) when the gas is lost 
(Tinsley 1978b). According to Equation (5.8), the power-law 
relation between Z, and M, must level off at large masses, 
with Z, — y in the limit when essentially all the original 
material is converted to stars; this behavior agrees with the 
levelling of the color-magnitude relation at the magnitudes 
of the brightest cluster galaxies. 

The critical parameter E can plausibly have a value that 
would give the right scale for the Z,—M, relation (Larson 
1974b), but its value is very uncertain so the success of this 
theory must be considered tentative. The interaction between 
supernovae and the ISM could, in fact, be so weak as to drive 
a wind in only the very smallest protogalaxies. 


5.3.2 Bursts of star formation in merging subsystems 


Since the largest elliptical galaxies are the most metal-rich, a 
natural hypothesis is that chemical enrichment accompanied 
the growth of galaxies by successive mergers among small sub- 
systems. As noted in Section 1.2, gaseous protogalaxies prob- 
ably consisted of many dense lumps, so it is only a change 
of viewpoint to consider these as merging subsystems rather 
than as a collapsing unit. Moreover, extrapolations backward 
in time from the incidence of strongly interacting galaxies in 
sight today suggest that collisions and coalescence were com- 
mon processes in the early lives of galaxies (Toomre 1977; 
Vorontsov-Velyaminov 1977). A property of colliding galaxies 
most relevant to chemical evolution is that they often appear 
to be undergoing intense bursts of star formation induced by 
the dynamical disturbance (Section 7.2), so it is reasonable 
to assume that star formation was caused in the past by coa- 
lescence of subsystems in a protogalaxy. A qualitative model 
of chemical enrichment by this process has been proposed by 
Tinsley & Larson (1979): elliptical galaxies form by a hierar- 
chical sequence of mergers among subsystems, starting from 
small unenriched gas clouds; a burst of star formation occurs 
at each merger, so at each stage of growth the fraction of 
the total mass in stars increases and the mean metallicities 
of stars and gas increase. In this picture, the final mass of an 
elliptical galaxy is determined by the total mass of the small 
pieces initially present in its surroundings. When these have 
all been mopped up, efficient star formation stops. Any resid- 
ual gas may get swept away if the system is moving through 
an ambient IGM, or possibly blown out in a wind; if it re- 
mains bound to the system, it could settle to a disk and leave 
the “elliptical galaxy” as the central bulge of a spiral. 

The resulting Z,—M, relation depends on the efficiency of 
star formation as a function of the mass of the system (i.e., the 
system that has been built after a given number of mergers), 
where efficiency is defined as the mass of stars formed (in a 
given burst) per unit mass of gas. An approximately power- 
law relation between Z, and M, can be obtained only if the 
efficiency increases with the total mass of the system, i.e., 
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with successive mergers. For example, a relation 


Efficiency of star formation œ (Total mass)’, (5.10) 
where p is a constant, leads to 

P 
Zo Mj’, (p (5.11) 


with the usual limit Z, — y when all the gas is consumed. 
The relation (5.10) can be justified qualitatively by considera- 
tions of gas compression during collisions and mergers of sub- 
systems. To reproduce the tentative empirical relation (5.2), 
Equation (5.11) needs p = 1/3, which is consistent with the 
compression arguments. Equation (5.11) results from (5.10) 
independently of such details as the mass distribution of 
merging pieces, and it can be understood as follows: Equa- 
tion (5.7) is true in any models with mass conservation (in- 
cluding here conservation of the total mass of merging pieces), 
while Equation (5.10) gives, dimensionally, 


M, 
œ M”, 


g 
so that 


M, œx M'*? when M, = M (M; < M); (5.12) 


Equation (5.11) then follows from (5.7) and (5.12). The power 
law is again predicted to level off, with Z, — y at high masses, 
according to Equation (5.8). 

As a theory for the origin of the metallicity—mass relation, 
this model has the advantage of invoking processes that can 
be studied in nearby interacting galaxies, but it remains to 
be seen whether the structural properties of elliptical galaxies 
are fully consistent with its dynamical implications. 


5.38.3 Mergers of stellar systems 


The color-magnitude (metallicity—mass) relation for ellipti- 
cal galaxies is apparently affected in a way that has nothing 
to do with chemical evolution: central cluster galaxies ac- 
crete their neighbors, by the process of dynamical friction. 
There is no star formation during these mergers, because the 
galaxies involved are ellipticals or SOs with almost no gas. 
Thus the growth in luminosity is not accompanied by chem- 
ical enrichment, and it can make the growing system bluer 
because the surrounding galaxies that it accretes are gener- 
ally smaller than the central giant. Galactic cannibalism by 
dynamical friction was first proposed by Ostriker & Tremaine 
(1975), and later papers (e.g. Hausman & Ostriker 1978, and 
references therein) have developed its implications for cos- 
mological tests, the origin of core—halo structure of cD galax- 
ies, the luminosity function of galaxies in clusters, and the 
color-magnitude relation itself. The process obviously tends 
to make the color-magnitude relation turn over toward bluer 
colors at the bright end. This effect has been proposed as a 
test for the occurrence of cannibalism in clusters, but the re- 
sults are not unambiguous because there is an intrinsic upper 
limit, Z, > y, to the average stellar metallicity in the models 
discussed above, that leads to a flattening of the relation any- 
way. Strong evidence that galaxies in the centers of clusters 
do merge with each other is given by the lumpy appearance 
of the central supergiant (cD) members of some clusters; the 
lumps are interpreted as recently swallowed galaxies, and the 
timescale for them to merge into a smooth system is generally 
< 10° yr (Gunn 1977). 
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5.4 The Intergalactic Medium and Gas Lost from Galaxies 


Loss of interstellar gas from galaxies can both affect their 
own evolution, as discussed for example in Section 5.3 above, 
and be a significant source of metal-enriched IGM. 


5.4.1 Loss of metals from galaxies 


The mass of metals lost from an elliptical galaxy can be es- 
timated by the following argument, which is independent of 
the method of gas loss. The mass of metals ever made by stars 
in the galaxy is ~yM, (by the definition of the yield, Equa- 
tion 3.12), and the mass of metals presently in its stars is 
Z;M;, so the mass lost to the IGM at some stage is ~(y—Z,)M,. 
This reasoning was used by Larson & Dinerstein (1975) to 
predict a substantial metal-rich IGM in clusters of galaxies, 
and a number of models with similar results have been ad- 
vanced since the iron X-ray line was discovered. An essentially 
model-independent estimate can be made as follows. 

Let ¢(M;) be the mass function of elliptical galaxies in a 
cluster. Then the total mass of metals they have supplied to 
the ISM is 
Mz = [v-z (M,)] M,¢(M;) dM;, (5.13) 
where Z,(M,) is a function derivable from the color-magnitude 
relation. In practice, M, is expressed in terms of luminosity, 
and ¢(M,) is obtained from the luminosity function. The value 
of y should be taken as the maximum Z, of an elliptical galaxy, 
which is hard to obtain since the extensive outer regions that 
are probably metal-poor are seldom observed; setting y equal 
to the mean metallicity of local stars (a little under Zo) is 
equivalent if elliptical galaxies have the local IMF. In a cal- 
culation equivalent to the one just outlined, Tinsley & Larson 
(1979) found that a cluster of elliptical galaxies would con- 
tain a mass ~(2 — 5) MoZo of intergalactic metals per solar 
luminosity. 

This is a very significant quantity of metals. For example, 
about 1/3 of the luminosity of the Coma cluster is due to its 
elliptical galaxies, so they would provide a mass ~1 MoZo of 
metals per solar luminosity of the cluster, corresponding to 
~0.1 Zo of metals per unit mass of galaxies (the ordinary stel- 
lar mass). If the bulges of spiral and SO galaxies also lost their 
metals due to the IGM, this estimate could be doubled, but 
some of those metals may be in the disks (cf. Section 4.3.1). 
Tron can be considered as a representative metal in this cal- 
culation, so we predict ~1 x (Fe/H)o X Mo of iron in the IGM 
per solar luminosity of the cluster. The actual mass of iron is 
quite uncertain, and could be equal to the predicted amount. 


5.4.2 Overall gas loss from galaxies 


The total mass of gas lost from elliptical galaxies is a less pre- 
dictable quantity, depending on gas flows within the galaxies 
and gain or loss of gas during the time of star formation. 
Nevertheless, some estimates are interesting. 


In order of magnitude, almost any model will predict a 
mean metallicity of the lost gas that exceeds the mean metal- 
licity of stars, since the gas has the composition of the last 
and most metal-rich stars formed; i.e., Zi 2 Z, The mass of 
gas lost is therefore Mg = Mzi/Z < Mzi/Z,. With Z, $ y ~ Zo 


5.5 Abundance Gradients in Disks 
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for the mean of a typical cluster of galaxies, we therefore ex- 
pect that Mai ~ Mzi/Zo, very roughly. This implies that the 
elliptical galaxies in the Coma cluster have ejected ~1 Mo of 
gas per solar luminosity of the cluster, which is at the lower 
end of the range of estimates of the cluster gas content, from 
X-ray data (Section 5.1). Various specific calculations using 
the models discussed in Sections 5.3.1 and 5.3.2 lead to simi- 
lar results within a factor ~3. It is therefore uncertain whether 
gas loss from ellipticals could be the entire source of gas in 
a cluster like Coma, or whether some of the IGM is simply 
primordial material that was never in a galaxy. (In the latter 
case, the intergalactic metals could nevertheless have been 
supplied by galaxies). 

An interesting comment on the origin of the cluster IGM 
has been made by Ostriker (1977b): the distribution of mor- 
phological types of galaxies in clusters like Coma differs from 
the field distribution in having a much smaller fraction of spi- 
rals, many more SOs, and somewhat more ellipticals (Oemler 
1974). If one “corrects” the cluster galaxies by adding disk 
matter until the overall ratio (disks)/(elliptical galaxies + 
bulges) equals that of the field, the mass of extra disk matter 
required is ~50% of the (ordinary) mass of galaxies in the 
cluster. This in turn is a significant fraction of the mass of 
cluster IGM. Ostriker (1977b) therefore proposes that some 
of the IGM is material that would have been made into disks 
in a less dense environment, but instead was swept up into a 
hot ambient IGM. This idea ties in with several scenarios for 
the formation of disk galaxies, in which the disk is made from 
diffuse gas that is accreted after the formation of a spheroidal 
component. For example, in the model of Ostriker & Thuan 
(1975), a significant fraction of the disk is shed by halo stars; 
and in the picture of Tinsley & Larson (1979), the disk forms 
from diffuse gas after denser pieces have merged to make the 
bulge. 


5.4.3 Ejection from evolving stars in elliptical galaxies 


The stellar population in elliptical galaxies (and SO galaxies 
and the bulge-halo components of spirals) is predominantly 
very old, and the light of these galaxies is dominated by red 
giants. It is almost certain that such stars lose a few tenths 
of a solar mass, between the MS turnoff and the end of their 
lives, to die as white dwarfs of typically 0.7M. (Section 2.4.1). 
This mass has been included in the total mass loss considered 
above, but it is interesting to calculate also the present rate 
of mass loss by stars in elliptical galaxies. 


For an analytical estimate, let us assume that all the stars 
in the system formed at the same time, t = 0. Let Mo be the 
mass of stars formed, and let ¢(m) be the IMF; the mass of 
stars formed in the mass interval (m, m + dm) is therefore 


n(m) dm = Mod(m) dm, (5.14) 
by Equation (2.1). Now imagine these stars peeling off the 


MS and dying soon afterward as they reach their lifetimes 
Tm. The number of stars dying per unit time is clearly 


dm 
D(t) = n(m) T , 
Tm 


where m, is the turnoff mass (Tm = t). The stellar mass- 
lifetime relation can be approximated by a power law, 


-0 
m Tr 
mı Ti : 


(5.15) 


Tm=t 


(5.16) 
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where T, is the lifetime of a fiducial mass m, and @ = 0.25 in 
the mass range of interest (m ~ 1 Mo). It is convenient to use 
a power-law IMF, Equation (2.3), normalized to é(m) = 6; 
masses in only the small range ~0.5 — 1 Mo are relevant to 
the following calculation, so this IMF may be a reasonable 
approximation even if a single power law would not apply to 
all masses. The ejection rate can be obtained by multiplying 
D(t) by (m — Wm), the mass lost per star with remnant mass 
Wm, With the result 


—1+0x 
E(t) = MoO (m — Wm) (=) . (5.17) 


1 
Since (m, — Wm) changes slowly with time and 8x is probably 
only a few tenths, this expression shows that the ejection rate 
E(t) varies approximately as f!. In Section 6.2.4, an analyt- 
ical expression is derived for the luminosity of stars in this 
model, and it is shown that Equation (5.17) leads to a ratio of 
ejection rate to integrated blue luminosity of the population, 


E 
— ~ 0.02 Mo Lag Gyr! 
Lg 


(5.18) 


at a present time ~10!° yr. 


Most elliptical galaxies have a mass of neutral hydrogen 
that is less than 0.1 times their luminosity, in solar units, 
and many better studied ellipticals have less neutral hydrogen 
than 0.01 times their luminosity (Knapp et al. 1978). Faber 
& Gallagher (1976) argue that significant amounts of ISM 
cannot be hiding in elliptical galaxies in ionized or molecular 
form. Thus the ejection rate given by Equation (5.18) would 
provide more than the observed amount of gas in a few Gyr, 
or even in less than 1Gyr. Possible fates for this gas have been 
thoroughly discussed by Faber & Gallagher (1976); they note 
that star formation at the rate in Equation (5.18) would be 
detectable (unless only low-mass stars form), so they conclude 
that the gas ejected from stars is being continually lost from 
the galaxies. On the other hand, Oemler & Tinsley (1979) 
argue that star formation at the rate required to use up this 
gas could have escaped detection in most ellipticals, and could 
account for their supernova rate. 


5.5 Abundance Gradients in Disks 


Abundance gradients in disk stars and gas cannot be fully 
accounted for by gradients in the gas fraction (Section 5.1), so 
it is of interest to see whether dynamical processes analogous 
to those discussed in Section 5.2, for spheroidal systems, could 
be responsible. A gradient in the IMF could again be invoked, 
but this mechanism will not be discussed further. 


5.5.1 Effects of infall 


The idea that disks of galaxies form by accretion, incorpo- 
rating metals from the young halo, has been suggested by 
dynamical models and supported by the metallicities of stars 
in the Solar neighborhood (Section 4.3). The properties of 
the accretion process that affect chemical evolution are the 
metallicity of infalling gas (Z;) and the ratio of SFR to in- 
fall rate (W/f). If there is a radial gradient in these quanti- 
ties, there must be a corresponding metallicity gradient in 
the disk. In particular, the metallicity of the gas at any time 
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Figure 12. (a) Star formation rates at several radii in the equatorial plane of a collapse model for the formation of a disk galaxy (Larson 
1976c). The radius in pc is marked on each curve, and the three ticks indicate the times at which star formation is 10%, 50%, and 90% 
complete (relative to the final mass of stars) at that radius. (b) Metal abundances in the gas (relative to the yield) in the equatorial plane 
of the same model (Tinsley & Larson 1978). In this Figure, the radii are given in kpc, ticks have the same meaning as before, and open 


circles denote the time of maximum gas density at each radius. 


tends to a value given by Equation (4.6), Z > ky + Z;, where 
k =(1-R)w/f. Thus the disk gas has about this metallicity 
at the present time, in regions where infall is at all effective, 
i.e., where k is not so large that Z takes too long to approach 
its asymptotic value; the stars in turn reflect the metallicity 
of the gas at the time when they formed. 

In the dynamical models studied by Tinsley & Larson 
(1978), the value of w/f decreases outward in the disk at 
all times, because star formation is less efficient at low den- 
sities; Z; is negligible at late times at all radii, but it has a 
significant negative gradient at early stages when metals from 
the young halo (central bulge) were most important near the 
center. Figure 12 (a) illustrates the SFR versus time at sev- 
eral radii in the equatorial plane of one of these models, and 
Figure 12 (b) gives the corresponding metallicities in the gas. 
At small radii, most stars form early from metal-rich infalling 
gas, while the outer regions experience star formation on a 
much longer timescale and are still relatively metal-poor. The 
gas at the present time thus has a negative metallicity gradi- 
ent due mainly to the gradient in w/f, while the gradient in 
stellar abundances is due partly to the early gradient in Zç. 
The sizes of the model gradients are comparable to observed 
values, so this very schematic model may possibly be showing 
some effects that occur in real disk galaxies. 

Generalizing these results, we can conclude as in Sec- 
tion 4.3 that the chemical properties of disks could plausibly 
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be strongly affected by gas flows that constitute the forma- 
tion of the disk itself. 


5.5.2 Effects of radial gas flows 


Radial inflow of gas in disks possibly occurs as a result of 
transfer of angular momentum by viscosity, loss of angular 
momentum from the gas to spiral or bar-like density waves, 
and other mechanisms (e.g. Kalnajs 1978; Ostriker 1977a). 
These processes are rather speculative, since the inflow in 
many models could be a numerical artifact, but it is interest- 
ing to see how chemical evolution could be affected by flow 
velocities of a plausible magnitude. 

The metals are concentrated inward by this process, as by 
gaseous dissipation in a collapsing spheroidal system (Sec- 
tion 5.2.1). Let us consider an annulus of a galaxy between 
radii r and r+ôr, measured in the disk. The chemical evolution 
of this annulus can be studied in the instantaneous recycling 
approximation, using Equations (3.17) and (3.19). Let Mg 
and y in those equations be replaced by 2arM,6r and 2arwor, 
respectively, where M, and y now denote the corresponding 
surface densities; let f be replaced by the net rate of inflow 
into the annulus, i.e., F(r)—F(r+6r) = —(OF/Or)or, where F is a 
flow rate (in Mo yr7') with a positive sign for outward motion; 
and let Z;f be replaced by the net rate of inflow of metals, 
which is Z(r)F(r)—Z(r+6r)F(r+6r) = —Z(OF/Or)6r—(OZ/Or)F or. 
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The equations then reduce to 


g 
or oe 2nr Or (5.19) 
and 

OZ 1 02 


Equation (5.20) shows that the radial flow is consistent with 
a steady-state abundance gradient, 
O(Z/y) d -Ry 
~ 2ar ; 

or F 
which is negative if the flow is inward. The flow causes Z to 
change on a timescale 
2rr° M, 

IFI ~ 
F can be expressed in terms of the flow velocity v, where |F| = 


(mass of gas in the annulus)/(time for gas to flow across ôr 
= 2nrM,|v|. The timescale for radial flow to be effective is thus 


2 (5.21) 


(5.22 


TRY 


r 


~—, 5.23 
ES l 
and the corresponding gradient can be written 
OZ/y) _ Tr 
eaa 5.24 
ô lnr) T; ( ) 


where T, = M,/(1 — R)y is the timescale for star formation to 
use up the gas. These relations show that rapid inflow, with a 
timescale less than that for star formation, quickly obliterates 
any radial metallicity gradient, while slow inflow can lead to 
a significant one. 

Substituting values of y, Mg, and r for the Solar neigh- 
borhood, it is found that the interstellar abundance gradient 
(Section 5.1) is consistent with inflow at a few km s™!, carry- 
ing a flux ~1Mo yr`!; the timescale for the gradient to change 
is a few Gyr. There is no strong evidence for the occurrence 
of systematic gas flows of this magnitude in the Galaxy, but 
nor can they be ruled out. 

Sanders (1977) has suggested that the deep minimum in the 
surface density of gas in the Galaxy, in an annulus between 
0.6 and 4kpc, could be due to inflow into the central 600 pc, 
where the total quantity of ISM is enough to fill the depleted 
annulus. If so, inflow could perhaps be fueling the strong star 
formation at the Galactic center. 
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Evolution of stars in galaxies affects not only their chemi- 
cal compositions, but also their integrated luminosities, col- 
ors, and spectra. Photometric and chemical evolution can be 
studied separately, because they depend largely on comple- 
mentary properties of a galaxy: the colors at a given time 
are governed strongly by the current rate of star formation 
relative to its past average value, whereas the chemical com- 
position depends mainly on the integrated past star forma- 
tion relative to the gas content and on the ratio of the SFR 
to gas flow rates. In studying photometric evolution, we can 
ignore the effects of ISM, except in correcting colors for any 
reddening or gaseous emission, and we can avoid assumptions 
relating the SFR to the gas supply. Of course, a complete un- 
derstanding of the properties of a galaxy would include the 
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relations among its history of star formation, gas content and 
gas flows, and chemical composition (cf. Figure 1), but more 
can be learned by tackling pieces of the puzzle separately 
first. 


6.1 Aims and Methods 


Models for the stellar population of a galaxy address three re- 
lated questions: what types of stars are present, what history 
of star formation produced them, and what was the popula- 
tion and its photometric properties in the past? The answers 
to these questions have many applications, such as interpret- 
ing in terms of star formation the correlations between pho- 
tometric and morphological properties of galaxies, and pre- 
dicting changes on cosmological timescales. 

Methods of constructing population models can be divided 
into three categories: “population synthesis” with no explicit 
evolution, evolutionary models, and analytical approxima- 
tions. 


6.1.1 Population synthesis 


This approach is to find the “best” mixture of stars to match 
the colors of the galaxy under study. The inferred distribu- 
tion of stars in the HR diagram then contains information on 
their past formation rate and the IMF, and it is often possible 
to judge the mean chemical composition and even to detect 
minor components of high or low metallicity. The procedure 
is to observe the colors of the galaxy and a variety of nearby 
stars, generally including narrow-band photoelectric colors 
and indices giving the strengths of spectral features that are 
sensitive to stellar temperature, luminosity, or composition. 
Then synthetic colors are computed for various mixtures of 
stars and compared with the galaxy colors. The search for 
an optimal mixture can be made in many ways, ranging from 
trial-and-error to elaborate computer algorithms; the method 
generally used, quadratic programming, was introduced to 
the field by Faber (1972). Because of observational errors, and 
because the available nearby stars do not include all types in 
the galaxy under study, a perfect fit is seldom found, and 
the solution that (formally) minimizes the errors is not nec- 
essarily the most plausible. The choice of a “best” synthetic 
population must therefore be based on imposed astrophysi- 
cal constraints; these include such requirements as a smooth 
MS luminosity function, and a distribution of subgiants and 
giants that could plausibly arise from evolution off the MS. 
The lack of an objectively defined best fit means that the fi- 
nal solution depends strongly on the imposed constraints, as 
emphasized by Williams (1976). 

There are often several astrophysically acceptable synthetic 
populations that match the galaxy colors equally well but cor- 
respond to significantly different histories of star formation. 
An example of such ambiguity appears in models for ellipti- 
cal galaxies, reviewed by Faber (1977). All studies agree that 
most of the light of these galaxies, from blue through infrared 
wavelengths, comes from an old stellar population with a dis- 
tribution in the HR diagram like an old open cluster plus an 
extended giant branch (cf. Figure 5). However, such mod- 
els almost always fail by a few percent to account for the 
light around 3500 A (e.g. U — B is predicted to be too red by 
~0.1mag), so they are lacking some hot stellar component that 
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is present in the real galaxies. To date it has been impossible 
to determine whether the hot stars are a few upper-main- 
sequence stars, implying ongoing star formation, or a minor 
population of old objects such as horizontal-branch stars or 
blue stragglers (which can be seen in the color-magnitude di- 
agram for the old cluster M67, Figure 5). Obviously, it would 
be very interesting to know if typical elliptical galaxies really 
are still making stars at a slow residual rate! For the central 
bulge of M31, which is optically indistinguishable from an el- 
liptical galaxy, there are broadband colors down to 1550 A, 
but even these data have not resolved the ambiguity (Wu 
et al. 1980). A new avenue has been opened by a demonstra- 
tion that the integrated light of stars in a nuclear bulge re- 
gion of our own galaxy matches exactly the integrated light of 
comparable regions of spirals and ellipticals (Whitford 1978). 
The brighter stars are individually observable in the Galactic 
bulge, so a star-by-star synthesis of their contribution to the 
light is possible. Perhaps in the end this approach will tell 
whether an old population alone can account for the ultravi- 
olet light. 


Another general problem is that even the best defined re- 
gions in the HR diagram cannot be interpreted uniquely in 
terms of a past history of star formation. The models are in- 
sensitive to many details of the IMF and SFR, for two basic 
reasons: 


(i) the integrated light of galaxies is dominated by regions 
of the HR diagram that depend theoretically on rather few 
parameters of star formation; and 


(ii) some types of stars, such as red giants, may have 
evolved from a wide range of MS masses (Section 2.4.2), so 
they cannot be traced uniquely back to an initial mass and 
time of star formation. 


The second of these problems is avoided in the evolution- 
ary method described next, but the first remains and will be 
discussed below. 


6.1.2 Evolutionary models 


This approach relies primarily on stellar evolution theory to 
suggest allowable populations, as follows. Theoretical tracks 
(or isochrones) of stars in the HR diagram are used to com- 
pute the stellar population that would arise, at a given age, 
from a given SFR and IMF, with a given chemical composi- 
tion; the integrated colors are then calculated, using observed 
colors of stars in appropriate parts of the HR diagram, and 
the results are compared with the colors of the galaxy under 
study. The aim is to derive from a series of models the SFR, 
IMF, age, and composition(s) that best match the galaxy, 
and thereby to learn not only about its present stellar pop- 
ulation but also about its past history and past photometric 
properties. 

In practice, stellar evolution is not well enough understood 
for fully theoretical models to be reliable. The main problems 
are related to late stages of evolution, including particularly 
the giant branches in old stellar populations, whose effects on 
models for elliptical galaxies are reviewed by Faber (1977). 
These problems are alleviated by using statistical studies of 
nearby giants to provide semi-empirical evolutionary tracks 
(Section 2.4.1), and by allowing the most uncertain types of 
stars to be present in numbers that are treated as adjustable 
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parameters. This method thus closely resembles some non- 
evolutionary population syntheses in which the constraints 
are chosen to represent stellar evolutionary tracks (O’Connell 
1976). 

The evolutionary approach has several advantages. The 
best established aspects of stellar evolution theory are in- 
corporated, so the resulting population is a possible one as 
far as can be determined. Uncertainties cannot be formally 
calculated, but from trials with a variety of assumptions one 
can estimate subjectively the allowable range of parameters. 
Often this range is small enough to lead to useful conclusions 
about the past history of star formation, and to predictions 
of photometric changes of cosmological interest. For example, 
it is possible to determine the slope of the IMF in elliptical 
galaxies closely enough to be sure that their integrated lumi- 
nosity declines with increasing age (Section 6.2). 

Uncertainties in the conclusions from this method arise 
partly from uncertainties in stellar evolution, and partly from 
the intrinsic insensitivity of integrated colors to many param- 
eters of interest — a problem found earlier with population 
syntheses. Two parts of the HR diagram tend to dominate 
the integrated light, as illustrated spectroscopically by the 
work of, e.g., Morgan & Mayall (1957) and Morgan & Oster- 
brock (1969): B stars on the upper main sequence, and late 
G through early M giants. These dominant regions are ex- 
tended out to O stars in ultraviolet light and to late M giants 
in the infrared. If young stars are absent, low-mass giants 
dominate at visual and longer wavelengths, so the colors de- 
pend much more on stellar evolution than on the IMF or past 
SFR; at shorter wavelengths, however, turnoff stars are seen 
so the colors give some information on the age of the system. 
If young stars are present, the light at short wavelengths is 
dominated by OB stars, whose relative numbers depend on 
the IMF and whose total numbers (relative to red stars) de- 
pend on the ratio of the present SFR to its integrated past 
value. Stars with lifetimes from a few times 108 yr to just be- 
low the age of the system (usually A and F stars) contribute 
relatively little light, so there is little information on either 
their part of the IMF or the detailed time-dependence of the 
SFR. In Section 7.1, models will be discussed that illustrate 
the dominance of the upper main sequence and/or low-mass 
giants, depending on the SFR. 


Programs for constructing evolutionary models have been 
described by Tinsley (1968, 1972a, 1978b), Searle et al. 
(1973), Tinsley & Gunn (1976a), and Larson & Tinsley 
(1978). The mechanical details are far less troublesome than 
the input “data” representing stellar tracks, and it is easy 
to obtain numerical accuracy far exceeding the astrophysi- 
cal certainty of the calculations. There are two types of this 
technique. 


(i) The first method is to supply the computer with evo- 
lutionary tracks in the HR diagram for stars with a series of 
discrete masses, or with isochrones for a series of discrete ages; 
separate stellar data are used for each chemical composition 
of interest. Then, for a given IMF and SFR, the calculation 
yields the numbers of stars on a large grid of points in the 
HR diagram, as a function of the age of the system. 


(ii) The second method uses the first type of program once 
only for each IMF and composition, to give the integrated 
colors at a series of ages of a model whose SFR consists of a 
single initial burst. These are then regarded as the colors of 
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“generations” of stars with a given age (and IMF and compo- 
sition). 


A model with any prescribed SFR can then be treated, 
at each age, as the sum of such generations in proportions 
given by the SFR. The number of generations whose proper- 
ties must be combined to obtain the integrated colors of any 
model is much smaller than the number of points in the HR 
diagram that are referred to directly in the first method, so 
the second approach is more economical. In either method, 
it is clearly possible to add arbitrary numbers of stars of un- 
determined evolutionary status, in the spirit of population 
synthesis. 

While models with Solar metallicity can rely on nearby 
stars to provide colors and semi-empirical evolutionary 
tracks, there is no such convenient sample for other com- 
positions. In making models for non-Solar metallicities, it is 
often most convenient to change the “standard” models dif- 
ferentially, rather than starting from scratch with tracks and 
colors for each set of stars. Faber (1973) first used the metal- 
licity effects discussed in Section 2.4.5 to estimate differential 
changes in the integrated colors of elliptical galaxies, as a 
function of metallicity, and her methods have been adapted 
by others subsequently. Recent results for elliptical galaxies 
have been cited in Section 5.1. The calculations of metallicity 
effects in integrated light are still much less secure than one 
would like, and there is a need for more basic work on stellar 
evolution and atmospheres at non-Solar compositions, includ- 
ing non-Solar abundance ratios among elements heavier than 
helium (Faber 1977). 


6.1.8 Analytical approximations 


Some of the results from evolutionary models can be under- 
stood qualitatively using analytical approximations. These 
have proved particularly tractable for models in which all the 
stars form in a single initial burst, which is a first approxi- 
mation to the population in elliptical galaxies. Such models 
will be considered next. 


6.2 Evolution of a Single Generation of Stars 


Many numerical models designed to match detailed photom- 
etry of elliptical galaxies have shown that nearly all the light 
at visual and longer wavelengths can be accounted for by a 
very old population, with a turnoff near the Sun’s position 
on the MS. The metallicities of the dominant stars appear to 
be within a factor of two of Solar in wide-aperture observa- 
tions of giant ellipticals, although their centers may be more 
metal-rich and small ellipticals are metal-poor (Section 5.1). 
Reviews by van den Bergh (1975) and Faber (1977) cover 
the history and recent status of this subject, and a few sub- 
sequent developments have been referred to in Section 6.1. 
The implications of a predominantly very old population for 
the evolution of elliptical galaxies are best understood using 
analytical approximations. 


6.2.1 Content and luminosity 


Let us consider a single generation of stars, formed with total 
mass Mp in a short burst (as in Section 5.4.3), with a fixed 
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chemical composition near Solar. The population evolves by 
peeling off the MS, as can be visualized from Figures 2 and 3. 

The IMF will be taken to be a power law, normalized to 
o(m,) = ġı, where m; is the turnoff mass at a fiducial time 
Tı. The power-law approximation need only hold over a small 
mass interval, since the light at present comes almost entirely 
from stars between 0.4M, and turnoff, and the turnoff mass at 
ages of interest, ~5—20Gyr, lies in the small range ~0.9-1.2Mo. 

At a time ¢ after star formation, the MS stars present 
have masses from the lower limit at formation, m, up to 
the turnoff mass m, which is given by substituting Tm = t in 
Equation (5.16). Thus the number of dwarfs with masses in 
the interval (m, m + dm) is, by Equation (2.3), 

—(1+x) 

na(m) dm = Moġ(m) dm = Mod (#) dm, m, < m < m. 


m] 


(6.1) 


Stars slightly more massive than m, are present as giants, and 
their total number is the number of stars that were on the MS 
with lifetimes between t and t—T,, where T, is the duration of 
post-MS evolution for masses ~m,. (The term “giants” is used 
loosely here to mean all post-MS stars; the analysis can easily 
be modified to refer to any portion of post-MS evolution). The 
number of giants is therefore 

dm 
dm 
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(6.2) 
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The luminosity of individual dwarfs in the mass range of 
interest can be approximated by a power law, 


tam =0(") , (6.3) 


my 

where a ~ 5. For giants, an average luminosity f is defined 
so that the product f,T gives correctly the integrated light 
output during post-MS evolution. The values of ¢;, a, and fg 
of course depend on the wavelength interval of interest, and 
so do the results below relating to luminosities. (For bolo- 
metric light, the product fT, is proportional to the amount 
of nuclear energy used, but it has no such interpretation in 
restricted wavelength bands). 

The integrated luminosities and masses of dwarfs and gi- 
ants can now be derived from Equations (6.1) — (6.3) and 
Equation (5.16). It will be assumed in the integrals that 
m, «x mı. The total mass of dwarfs at time t depends critically 
on whether the slope of the IMF (x) is less than or greater 
than 1: 

2 —x+1 
dal (=) , x>1l,  (6.4a) 


x-1 


m 
M(t) = f mnm) dm = { Modim; m(#), x=], (6.4b) 
mL 

M. m2 t —4(1-x) 

Koei (=) ,x<l. (6.4c) 
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Giants have a total mass ~m,n,(t), and one can quickly verify 
that the mass ratio of giants to dwarfs is greatest in the case 
x< l, and is at most ~Tg/t ~ 0.1; the contribution of giants 
to the total mass will therefore be neglected. The integrated 


luminosity of dwarfs is 


La(t) = i 
mL 


on the assumption x < œ, which is justified below. Finally, the 
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integrated luminosity of giants is 


—1+0x 
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Ti T 
The above relations will be used to derive some interesting 
properties of this single generation of stars. 


6.2.2 Remnants of dead stars 


There may be a significant dark mass in the form of rem- 
nants of stars initially above m, especially if the IMF has a 
fairly shallow slope so these stars were relatively numerous. 
Although it is probably a very poor approximation to extrap- 
olate the IMF to high masses with the slope x used near 1Mo, 
the equations will be written to show how the contributions 
of remnants can be estimated in the simplest cases. (These 
results can easily be modified to allow for a variable slope). 
In this approximation, it will be assumed that all remnants 
have the same mass w, and that all stars above m are dead. 
Then the total mass of remnants is w times the number of 
stars formed with masses between m, and the upper limit my: 


my 


M,,(t) = w Mod(m) dm = 


m 


maa t i 


- (6.7) 


assuming my < m and x > 0. The relative mass of remnants is 
potentially greatest if x < 1, and then Equation (6.4c) shows 
that Mw/Ma ~ w/m, which could be close to unity. This re- 
sult is obviously strongly dependent on the assumption of a 
single power law for the whole IMF, which would exagger- 
ate the mass of remnants if, for example, elliptical galaxies 
have a curved IMF like the function in the Solar neighbor- 
hood (Figure 4). It may be concluded that dead remnants 
could possibly affect the total mass by a factor ~2, which 
cannot be predicted with any confidence from constraints on 
the slope of the IMF at turnoff. 


6.2.3 The ratio of giants to dwarfs in the light 


Some spectral features in the integrated light of elliptical 
galaxies depend sensitively on the relative amounts of light 
contributed by giant and dwarf stars at the feature wave- 
length. Examples are an iron hydride band at 0.99 um, known 
as the Wing-Ford band, which Whitford (1977) has found to 
be extremely strong in late dwarfs but weak in late giants; 
and a carbon monoxide band at 2.2 um, studied especially by 
Frogel et al. (1978, and earlier papers cited therein), which 
has the opposite behavior, being much stronger in late gi- 
ants than in late dwarfs. Since the light of elliptical galaxies 
at those wavelengths must be dominated by late-type stars, 
the galaxies should show a weak FeH band and a strong CO 
band if giants outshine dwarfs, and vice versa. As the fol- 
lowing analysis shows, the relative luminosities of giants and 
dwarfs give important information on the slope of the IMF, 
which in turn affects many other properties of elliptical galax- 
ies including the rate of evolution of total luminosity; it is the 
significance of this effect for cosmological tests (Section 6.2.6) 
that has motivated much of the analysis of spectral features. 

Equations (6.5) and (6.6) together give an approximate 
expression for the relative luminosities of giants and dwarfs: 
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An alternative expression is obtained by substituting Equa- 
tions (5.16) and (6.3): 
€sTs 


G(t) = Ae Ge 


(6.9) 


The term fT is the amount of energy radiated (at a given 
wavelength) by a star of approximately turnoff mass after it 
leaves the MS, while fj (m)t is the energy radiated during 
MS evolution. Thus Equation (6.9) says that the value of G 
in bolometric light is, in order of magnitude, equal to the 
ratio of nuclear fuel consumed after leaving the MS to that 
consumed on the MS; since stars near 1M, burn the hydrogen 
in only 10% of their mass while on the MS but in 70% before 
they die (Section 2.4.1), this fuel ratio is ~6. This high value is 
the underlying reason why giants can outshine dwarfs in the 
integrated light of a galaxy, despite their very short lifetimes. 
Giants tend to be especially dominant at long wavelengths, 
because most of the energy from the giant branch as a whole 
comes from red giants. 

The fuel burning ratio is not the only factor affecting G, 
however. The term (a — x) in Equation (6.9) introduces a 
dependence on x, the slope of the IMF. A larger value of x re- 
duces the contribution of giants simply by reducing the num- 
ber of stars in the mass range of giants (just above turnoff) 
relative to those still on the MS. The dependence of G on x 
is of great practical importance, since it allows spectroscopic 
criteria to set constraints on x. The work of Whitford (1977) 
and Frogel et al. (1978) shows that the red—infrared light of 
elliptical galaxies is strongly dominated by giants, to an ex- 
tent that x must be less than 2, and possibly less than 1. 
These constraints are consistent with the IMF in the Solar 
neighborhood, which has x < 1 in the relevant mass range 
(Figure 4 and Equation 2.9). 

The infrared spectra of elliptical galaxies set constraints 
not only on the IMF but also on the relative numbers of 
M giants of different spectral types that populate the gi- 
ant branch. As noted in Section 6.1.2, these numbers are 
not firmly predicted by stellar evolution theory, so studies of 
galaxy spectra can add to an understanding of late stages in 
the lives of low-mass stars. This application of galaxy models 
is discussed by Faber (1977), Tinsley (1978b), and references 
therein. 


6.2.4 The stellar mass loss rate relative to luminosity 


An expression for the rate of mass loss from stars has been 
derived in Section 5.4.3, but Equation (5.17) is not in a useful 
form for comparing with observable quantities. It is possible 
to obtain a useful equation for the ejection rate per unit in- 
tegrated luminosity, because both quantities scale with the 
populations of stars near turnoff. 

From Equations (6.5) and (6.8), the total luminosity can 
be written 

—O(a-x) 
Li) = [1+ GO] La) = Modimér q + a(+) : (6.10) 
a-x Tı 

Then, with Equation (5.17), the ratio of ejection rate to lu- 
minosity is 
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which shows that the ratio depends only slowly on time. A 
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more useful relation for finding the present ratio is given by 
substituting Equation (5.16) and (6.3) to eliminate ¢, and 71, 
with the result 

E(t) 


Lo) = O(a — x) 


m—-Wm l 
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(6.12) 


This ratio can be estimated for present-day ellipticals as 
follows. From spectroscopic studies in blue light, G =~ 1 (the 
value of G is greater in red or bolometric light); and ap- 
proximate values of the other quantities are a = 5, 6 = 0.25, 
m = 1 Mo, Wm = 0.7 Mo, a = 1 Lgo, t = 10 Gyr, x = 1. The 
result from Equation (6.12) is then E/Lg = 0.015Mo LgoGyr|, 
of which the significance was discussed in Section 5.4.3. 


6.2.5 The mass-to-luminosity ratio 


An analytical estimate of M,/L can be made using the mass of 
stars M, ~ Ma(t) (neglecting the small contribution of giants 
and the very uncertain contribution of dead remnants), and 
the total luminosity L(t). From Equations (6.4a) — (6.4c), it is 
clear that the result depends strongly on whether x $ 1. More- 
over, it depends critically on the assumption that x is constant 
down to m,, since the least massive stars (or sub-stellar ob- 
jects) can be numerous enough to dominate the mass while 
contributing negligibly to the light. If x < 1, the result from 
Equations (6.4c) and (6.10) is 


M, a-x 1 


Mi 


,x<i, 
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which is proportional to the mass-to-luminosity ratio of 
turnoff stars. If x > 1 (or if x increases from a value below 1 at 
turnoff to above 1 at smaller masses) Equation (6.4a) shows 
that M,/L increases in proportion to me”, so it is sensitive 
to a quantity that cannot be determined photometrically. 

In all cases, photometric data (star counts, population syn- 
theses, spectroscopic estimates of x) yield only a lower limit 
to the true mass-to-luminosity ratio (M/L) of a galaxy, since 
any amount of mass could be present in hidden form. When 
the masses of galaxies are determined dynamically, the em- 
pirical M/L values often increase to such large values in the 
outer regions that a large amount of hidden mass must indeed 
be present (e.g. Spinrad et al. 1978). For this reason, values 
of M/L determined from population syntheses and equiva- 
lent methods are sometimes called “photometric M/L ratios” 
to distinguish them from the ratios of actual mass (defined 
dynamically) to luminosity. 


6.2.6 Evolution of luminosity and the Hubble diagram 


In one of the classic cosmological tests, the Hubble diagram, 
logarithmic redshifts of galaxies are plotted against against 
their apparent magnitudes, as illustrated schematically in 
Figure 13. For a sample with a well-defined mean absolute 
magnitude, this diagram can be regarded heuristically as a 
plot of “recession velocity” versus “distance”. At small red- 
shifts, the regression line is linear with a slope correspond- 
ing to Hubble’s Law, redshift œ distance. At large redshifts, 
the deviation from linearity measures the change in the ratio 
“velocity” / “distance” with distance itself; since the lookback 
time (the light-travel time) increases with distance, the cur- 
vature of the Hubble diagram thus gives a measure of the past 
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Figure 13. Schematic Hubble diagram showing how both the de- 
celeration parameter (go) and evolution of galaxies affect the de- 
parture from linearity. Lines are schematic “theoretical” curves for 
two values of qo, dots are hypothetical data points, and arrows in- 
dicate qualitatively how they should be corrected if the net effect 
of evolution is to make distant galaxies intrinsically brighter than 
nearby ones. More precisely, evolution (in this sense) at the rate 
of a few percent of a galaxy’s luminosity per Gyr makes the true 
value of go smaller by about unity than the value inferred from the 
uncorrected data points. 


expansion rate of the Universe, and in particular of its decel- 
eration. The deceleration parameter qo can take only positive 
values in the simplest cosmological models of General Rela- 
tivity, the Friedmann models, and 1/2 is a critical value: if 
qo > 1/2, the deceleration is large enough for the expansion 
eventually to be reversed, but if 0 < go < 1/2, the Universe 
will expand forever; if in fact go is negative, indicating that 
the expansion is accelerating, more complicated cosmological 
models are required. Evolution of galaxies enters the picture 
because the lookback times sampled must be many Gyr for 
the deceleration to be detectable; the galaxies then had sig- 
nificantly different luminosities, so the “distance” parameter, 
apparent magnitude, cannot be estimated on the assumption 
of a constant absolute magnitude. The departure of the Hub- 
ble diagram from linearity is very sensitive to evolution: if the 
luminosities of elliptical galaxies grow fainter at a few percent 
per Gyr, for example, the apparent value of qo (inferred from 
the shape of the Hubble diagram) exceeds its true value by 
several tenths. This problem has been discussed by Humason 
et al. (1956), Sandage (1961b,c), Gunn & Oke (1975), and 
Tinsley (1972b, 1977b). For an approximate estimate of the 
evolutionary correction to qo, the above analytical equations 
can be used. 
From Equation (6.10), we have 


d(In L) ee t a 
d(in t) 1+G dt 
and Equation (6.8) can be used to evaluate dG/dt. The term 
f,T, in the expression for G(t) depends only slowly on time, 
because giant branch evolution depends only weakly on mass 
in the relevant range, so only the explicit time-dependence 
need be considered and that term gives (t/G)(dG/dt) = 0a —- 1. 
Substituting in Equation (6.14), we have 
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Figure 14. Evolution of colors and magnitudes of single-generation 
models for the stellar population in elliptical galaxies (Tinsley & 
Gunn 1976a). Curves are for three values of the slope of the IMF: 
solid lines, x = 2; dashes, x = 1; dots, x = 0. Note that if x is small, 
colors evolve slowly but magnitudes evolve quickly. 


The second term in Equation (6.15) is not very important, 
since (ĝe — 1) is a few tenths and G/(1 +G) lies between 0 and 
1. The main term is therefore simply (—6a + @x), which can be 
written 


d(n L) 
= —1.3 + 0.3x. 
o 3 +0.3x 


Essentially the same result is obtained for the evolution of 
luminosity in numerical population models. The examples in 
Figure 14 show the predicted dependence on the IMF: the 
rate at which My gets dimmer is slower in models with a 
larger value of x. Since giants supply most of the light, this 
behavior is mainly because, when x is large, the giant branch 
is fed by a more richly populated main sequence as time goes 
on. 


(6.16) 


In the Hubble diagram, evolution means that departures 
from linearity are due not only to go but also to systematic 
changes in the absolute magnitudes of galaxies (Figure 13). 
If the curvature is interpreted without regard to evolution, 
the result is an apparent value of go that differs from the true 
value by 


d(n L) 
d(n À 


(e.g. Tinsley 1977b). A first-order estimate, from Equa- 
tion (6.16), is therefore 


Ago = 2.0 - 0.4x. 


Aqo = apparent value — true value = —1.5 


(6.17) 


(6.18) 


The slope of the IMF, x, emerges as the critical parameter. 
As discussed in Section 6.2.3, spectroscopic studies indicate 
that x < 2, and possibly x < 1. In the first case, |Aqgo| z 1, and 
in the second case, |Ago| z 1.5. In either case, the correction 
for evolution is big enough to make a qualitative difference 
to the type of cosmology inferred. 

Current estimates of the apparent value of qọ range from 
~0 (Gunn & Oke 1975) to ~1.5 (Kristian et al. 1978), the dif 
ferences being due to unknown sampling and observational 
effects. Downward corrections of order unity are clearly im- 
portant in determining whether the true value of qo is greater 
than 1/2, less than 1/2, or even negative. 
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A value of x = 5 would be needed to make stellar evolution 
negligible in the Hubble diagram, but such a steep IMF would 
make the late dwarfs dominate the infrared light to an extent 
that is precluded by the giant-dominated spectra of elliptical 
galaxies. A possible loophole is the following. Equation (6.16) 
depends on the value of x for stars near turnoff, while the 
infrared spectra depend on the ratio of giants to dwarfs of 
types K and M; if the IMF were to turn over sharply between 
~0.5 and 1 Mo (i.e., having far fewer less massive stars that 
the turnoff slope would predict), one could have both a steep 
slope at turnoff and a very small contribution from dwarfs 
to the infrared light. This idea is, of course, completely ad 
hoc, since the IMF in the Solar neighborhood has x < 2 for 
all masses < 10 Mo, and does not cut off above ~0.2 Mo. It is 
therefore most reasonable to conclude that elliptical galaxies 
have giant-dominated spectra because the IMF has a fairly 
shallow slope at turnoff; if so, their luminosity evolves fast 
enough to make the apparent value of go exceed its true value 
by 1 or more. 

However, this is not all we need to know to unravel the Hub- 
ble diagram. The galaxies used for this test are the central 
cluster giants, which are believed to grow secularly by can- 
nibalizing their neighbors (Section 5.3.3). This process could 
plausibly lead to a growth rate in the total stellar population 
of several percent per Gyr, with a corresponding increase of 
luminosity in opposition to the effect just discussed. The dy- 
namical effects cannot yet be calculated accurately enough 
for a correction to be applied to the Hubble diagram, so this 
test does not yet give a usefully accurate value of go. The 
situation is reviewed by Tinsley (1977b). 


6.2.7 Evolution of colors 


Predictions of color evolution are of interest because they can 
be tested by observations of distant elliptical galaxies whose 
ages are several Gyr younger than nearby galaxies. 

The colors of a single-generation population become red- 
der with age, if the main course of stellar evolution is peeling 
off the MS at turnoff and following the red giant branch. 
The main contribution to the color change is the redward 
evolution of the turnoff, since giant evolution is insensitive 
to turnoff mass in the range of interest. Consequently, col- 
ors evolve faster if the light is less giant-dominated, i.e. if x 
is larger, in contrast to the integrated luminosities just dis- 
cussed. This behavior is illustrated in Figure 14. 

Qualitatively different behavior is predicted if the stars can 
lose enough mass to become blue horizontal-branch stars, in- 
stead of the red “clump” giants that normally represent the 
core helium burning stage of metal-rich low-mass stars (Sec- 
tion 2.4.1). It has been suggested that such stars lose mass at 
a variety of rates, some becoming late red giants and others 
becoming blue. Numerical models for galaxy populations in 
which mass loss occurs stochastically on the red giant branch 
have been studied by Ciardullo & Demarque (1978). Because 
evolution to a blue position in the HR diagram occurs only 
if the star has a small mass of envelope left, the fraction of 
giants becoming blue increases as the turnoff mass decreases. 
The upshot is that the integrated colors of the model galaxies 
evolve blueward after ~8 Gyr. 

Observations of distant elliptical galaxies are ambiguous 
on this point, as reviewed by Spinrad (1977). Some of the 
most distant central cluster galaxies known, with redshifts 
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~0.6, have intrinsic colors that are bluer than those of nearby 
ellipticals, but the distant galaxies were selected on the ba- 
sis of strong radio emission so they may be atypical. If they 
are typical, the color change is about that expected accord- 
ing to the type of models that evolve monotonically toward 
redder colors (e.g. Figure 14); the lookback time sampled is 
~4 — 7 Gyr, depending on the cosmological model. Another 
sample of central cluster galaxies with redshifts up to nearly 
0.5 has no systematic dependence of color on redshift that 
can be disentangled from the intrinsic scatter (Wilkinson & 
Oke 1978). 

Dramatic color differences between nearby and distant 
galaxy populations in clusters have been discovered by 
Butcher & Oemler (1978a,b). Nearby clusters, i.e. those with 
lookback times < 1 Gyr, have galaxy populations that are 
strongly correlated with the cluster morphology: loose, ir- 
regular clusters have a large fraction of spiral galaxies, and 
centrally concentrated, regular clusters have very few spirals 
and mainly S0 and elliptical galaxies; the brighter galaxies 
in regular clusters are correspondingly all red. However, in 
two regular clusters with lookback times ~5 Gyr, the bright 
galaxies are found to have a wide range of colors, including 
many that are as blue as late-type spiral galaxies. On the as- 
sumption that the distant regular clusters represent younger 
versions of the nearby ones, these very blue galaxies must 
evolve (in a few Gyr) into red SOs or ellipticals. The color 
change observed is many times greater than any predictions 
based on the evolution of single-generation populations, so it 
is concluded that those galaxies were actively forming stars 
just a few Gyr ago. Presumably, they are mainly the pre- 
cursors of SO galaxies seen in nearby clusters, in which star 
formation is undetectable. 
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The stellar populations in most galaxies are far more compli- 
cated than those in ellipticals, because young stars are im- 
portant contributors to the light. The time-dependence of the 
SFR is therefore an important parameter in addition to the 
three quantities (age, IMF, and metallicity) used to charac- 
terize old populations, and the latter quantities could also be 
changing in time. Moreover, the colors of spiral and irregular 
galaxies are often affected by internal reddening and gaseous 
emission lines. Despite the complications presented by these 
galaxies, it is especially interesting to try to understand their 
photometric properties in terms of histories of star formation. 
Applications of such studies include explaining correlations 
between form and photometric properties, finding what phys- 
ical conditions are conducive to star formation, and searching 
for young galaxies. 

Models for galaxies with ongoing star formation are usually 
numerical; analytical approximations are cumbersome except 
in the simplest case of a constant SFR (Tinsley 1973). This 
Section will consider models that study only a few simple 
properties, mainly just UBV colors. Although more can be 
learned from spectroscopic details, the UBV system has the 
advantage of an extensive and homogeneous compilation of 
galaxy colors in the Second Reference Catalogue of Bright 
Galaxies (de Vaucouleurs et al. 1976; to be referred to as 
RC2). 
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7.1 UBV Colors of Normal Galaxies 


The UBV colors of a sample of morphologically normal galax- 
ies are shown in Figure 15; the crosses are all elliptical and S0 
galaxies, and the dots are a variety of morphological types, 
which we consider first. The colors of these galaxies form 
such a narrow distribution in the two-color diagram that it is 
tempting to look for one dominant parameter that could vary 
among galaxies and lead to a one-dimensional set of colors. 
Because the appearance and spectra of galaxies suggest a pro- 
gression of star-forming activity, ranging from very active in 
late-type irregulars to negligible in ellipticals, it is natural to 
suggest that the color sequence is due to different proportions 
of young and old stars. Population syntheses and evolutionary 
models have confirmed this view, and their conclusions can 
be summarized (with some oversimplification) in a “standard 
scenario” for galaxy evolution: normal galaxies have the same 
IMF and mean stellar metallicities, and they are of the same 
age, but they differ in the time-dependence of their SFRs; in 
particular, the latest (bluest) types of galaxies form stars on 
a long timescale, while the earliest (reddest) ceased star for- 
mation long ago. This hypothesis is obviously inaccurate in 
detail, but it provides a useful starting point. It will be used 
to construct a series of “standard” galaxy models, whose col- 
ors will be compared with observations, and then the effects 
of factors other than the SFR will be considered in turn. 


7.1.1 “Standard” models 


The methods discussed in Section 6.1.2 have been used by 
various authors to construct models corresponding to the 
standard scenario; the (typical) results shown here are from 
Larson & Tinsley (1978). Let us consider models with the 
IMF of the Solar neighborhood, Solar (or old-disk) metal- 
licity, and an age of 10 Gyr; the exact choice of these stan- 
dard parameters is not critical, as shown below. The models 
have monotonically decreasing SFRs (w), ranging from con- 
stant to a single initial burst lasting 10’ yr. Different series of 
models have different monotonic functions y(t) between these 
extremes, such as exponential functions, negative powers of 
time, and combinations of the constant and single-burst mod- 
els as two components in different proportions. The colors of 
these series in the UBV diagram all lie very near the locus 
indicated by a single heavy line in Figure 16: the model with 
a constant SFR is at the top of this line, that with an initial 
burst is at the bottom, and the form of the curve in between 
is essentially the same for all series with various functional 
forms for y(t). The theoretical locus for these standard mod- 
els is very close to the observed mean locus for galaxies of 
different types (line in Figure 15), so the standard scenario 
is at least superficially consistent. Two conclusions can be 
stated. 


(i) The UBV colors of normal galaxies can in general be 
accounted for by models with the same age, metallicity, and 
IMF; most of the observed colors lie in the range predicted 
for monotonically decreasing SFRs, within the observational 
errors. A further conclusion is that late-type galaxies are not 
necessarily young, even though their appearance and blue- 
light spectra are dominated by short-lived OB stars; the in- 
tegrated colors are instead consistent with an underlying pop- 
ulation of stars with ages up to many billions of years. These 
conclusions have been stressed in the context of evolutionary 
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Figure 15. Two-color diagram for morphologically normal galaxies 
and globular clusters. Filled circles: galaxies from the Hubble Atlas 
(Sandage 1961a), excluding peculiars and those with galactic lati- 
tudes |b| < 20°, with corrected colors from the RC2; the error cross 
is for this sample, and the solid line is its mean locus estimated 
by eye (Larson & Tinsley 1978). Crosses: E and SO galaxies in 
the Virgo cluster, with colors from Sandage (1972), corrected for 
reddening according to the RC2 formulae. Open circles: galactic 
globular clusters (excluding those with Eg-y > 0.05), with colors 
from Harris & Racine (1979), corrected for reddening according to 
the RC2 formulae. 


models by Tinsley (1968), Searle et al. (1973), Larson & Tins- 
ley (1974, 1978), and Huchra (1977). Caveats and deviations 
from the norm are discussed below. 


(ii) Models with monotonically declining SFRs (and with 
the same age, metallicity, and IMF) define a one-parameter 
sequence in the (U-B, B-V) plane. Inspection of the models 
shows that the parameter is the ratio of the present SFR to its 
average value over the lifetime of the system; or equivalently 
the SFR per unit mass of stars (W,/M, or W/W,t,); or equiv- 
alently the inverse of these quantities, a timescale for star 
formation T; = Wt: /W, in the notation of Section 2.2. Galax- 
ies of the latest morphological types are the bluest objects 
in Figure 15, and these evidently have the longest timescales 
for star formation, while the earliest types, which are the red- 
dest, have the shortest timescales; this point will be discussed 
further in Section 7.1.6. 


The one-parameter sequence shows that UBV colors for a 
given value of T; are almost independent of the functional 
form of y(t), as long as it is monotonic. An unfortunate con- 
sequence of this result is that the UBV colors of a galaxy (if 
near the mean locus in Figure 15) cannot give any more infor- 
mation about the SFR than the quantity Tı. In practice, they 
give less information, because of ambiguities due to possible 
variations of metallicity, etc., as shown below. 

The sensitivity of colors to the single parameter T; is due to 
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Figure 16. Theoretical two-color diagram for galaxies with mono- 
tonic SFRs, Solar metallicity, and the local IMF (Section 7.1). 
Heavy line: “standard” models, i.e. those of age 10 Gyr, with SFRs 
ranging from constant at the top to a single initial burst at the 
bottom. Light solid lines: models differing from the standard set 
only in age, as indicated. Dashes: models differing from the stan- 
dard set only in having an IMF with a constant slope, as marked. 
Dash-dot line: models differing from the preceding set with x = 1 
only in having an upper stellar mass limit my = 10 Mo, whereas all 
other models shown have my = 30 Mo. Arrows: approximate esti- 
mates of the effect on colors of blue and red galaxies, respectively, 
of altering the metallicity by the factor indicated. 


the dominance of low-mass giants and/or young OB stars in 
the light of galaxies, as discussed in Section 6.1.2. In effect, 
the contribution of low-mass giants is proportional to the 
number of long-lived stars ever formed, and the contribution 
of upper-main-sequence stars is proportional to the present 
SFR, so their ratio is proportional to T1. The integrated colors 
are insensitive to details of the past SFR because A — F dwarfs 
with intermediate lifetimes contribute relatively little light, 
and because the nature of the giant branch changes little 
over a wide range of MS lifetimes for the precursor stars. For 
the same reasons, it is difficult to extract significantly more 
information about the history of star formation in a galaxy 
from more detailed photometry than from UBV colors. 


We next consider some possible problems with the simple 
one-parameter scenario. 


7.1.2 Possible effects of errors 


Three systematic discrepancies between the models and data 
can be seen on comparing Figures 16 and 15: the heavy the- 
oretical line lies about 0.05 mag about the empirical mean 
locus, some galaxies are bluer than the bluest model, and 
some are redder than the reddest model. The systematic off- 
set is no more than could be due to errors in the stellar evolu- 
tion tracks, judged from series of models based on alternative 
tracks. If this offset is corrected ad hoc by moving the heavy 
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theoretical line downward, there are still some bluer and red- 
der galaxies than predicted. The differences seem to be too 
big to ascribe to uncertainties in the stellar evolution used, 
and they cannot be corrected by redefining the “standard” 
age or metallicity, since an improvement at the red end would 
leave more discrepant galaxies at the blue end, and vice versa. 
Nor can the “standard” IMF be changed, since if the IMF is 
universal it must be the same as the local function. There- 
fore, not all of the discrepancies between the heavy line and 
the data are due to theoretical errors within the framework 
of the standard scenario. 

Although the mean error bar shown for the data in Fig- 
ure 15 is small, some of the colors may have significantly 
larger errors due to uncertainties in the reduction. The colors 
plotted were corrected in the RC2 on a statistical basis for 
Galactic and internal reddening, so excessively red and blue 
galaxies could result from inappropriate corrections in a few 
cases. A reddening vector (from the RC2) is shown in Fig- 
ure 15, and it indicates that galaxies away from the ends of 
the distribution could not be moved far from the mean locus 
except by extremely large over- or underestimates of their 
reddening, because the vector happens to lie almost parallel 
to the mean locus itself. 

Emission lines can affect the colors of late-type galax- 
ies, but the estimates made by Huchra (1977) indicate that 
morphologically normal galaxies are unlikely to have strong 
enough gaseous emission for this to be important. 


In summary, it seems likely that some normal galaxies have 
colors that are too red or too blue to be accounted for by 
the standard scenario. Two questions arise: How can the dis- 
crepant galaxies be accounted for? And could normal galaxies 
have significant variations in age, IMF, or metallicity that do 
not show up on the UBV plot? 


7.1.8 Variations in age 


Light lines in Figure 16 indicate the effects of allowing ages 
between 5 and 20 Gyr. The loci for different ages overlap, so 
most of the galaxies in Figure 15 could have any ages in this 
range. The extreme colors, however, do depend on age, and 
the bluest and reddest data points could be accounted for if 
the ages of galaxies vary by a factor ~4. We shall see that 
this is not the only possible explanation of those data points, 
since metallicity effects are probably important. 


7.1.4 Variations in metallicity 


Effects of different stellar metallicities (Z,) can be estimated 
as outlined in Section 6.1.2, and some approximate results are 
indicated in Figure 16; the slope of the vector for red galaxies 
is empirical, but the slope for blue galaxies and the length of 
each vector are uncertain by factors ~2. 

As discussed in Section 5.1, the sequence of colors for E—SO 
galaxies (crosses in Figure 15) is regarded as one of metallic- 
ity. This sequence closely overlaps the locus of galaxies with 
different SFRs and ages, so UBV colors alone cannot unam- 
biguously give the SFR parameter (Tı), age, and Z, for a 
population of stars. The reddest points in Figure 15 are gi- 
ant elliptical galaxies, which almost certainly have a mean Z, 
greater than Solar (in the aperture used for the colors); if the 
galaxies have some residual star formation, it is undetected 
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to date, but it could affect the colors enough to change the 
estimated Z, and/or age somewhat. 

The bluest points in Figure 15 are small late-type galaxies. 
It is known from studies of the gaseous emission lines in some 
such galaxies that they can be significantly metal-poor (e.g. a 
factor of 4 in the Small Magellanic Cloud; Pagel et al. 1978), 
so a low Z, could help to make these points very blue. The 
effects of abundance changes on the colors of blue galaxies 
are too uncertain to say whether another effect, such as a 
somewhat younger age, is also required to account for their 
being bluer than the standard sequence. 


7.1.5 Variations in the IMF 


To show possible effects of variations in the IMF, Figure 16 
includes the loci of UBV colors of models differing from the 
standard sequence only in their IMF. Two of the variants 
have constant slopes, x = 1 and x = 2, while the third has 
x = 1 and an upper limit my = 10 Mo compared to 30 Mo in 
all other cases!!. For blue galaxies, the local IMF gives colors 
between those for x = 1 and x = 2. In general, the colors 
are redder with a larger value of x or a smaller value of my, 
since there are relatively few upper-MS stars. Comparisons 
with Figure 15 show that the variants illustrated are about 
the largest deviations from the local IMF that one could have 
without predicting a greater color spread than is observed. 
This conclusion applies only to the bluer galaxies, and only 
to stars = 1M, that contribute significantly to their light. It is 
clear from Figure 16 that the UBV colors of redder galaxies 
(B-V z 0.8) are very insensitive to the variations of IMF 
considered. Additional information on the IMF, derived from 
spectroscopic studies and M/L, was discussed in Section 2.2.2. 
Possible departures from the local IMF discussed there are 
a lack of stars above 10 Mo in some early-type spirals, and 
ubiquitous variations in the fraction of very low-mass objects. 
In elliptical galaxies, the IMF between ~0.4 and 1 Mo cannot 
be very much steeper than the local function (Section 6.2.3). 


7.1.6 Relevance to the formation and structure of normal 
galaxies 


To summarize the preceding discussion, every aspect of the 
standard scenario has been shown to have its weaknesses: the 
metallicity, IMF, and age are known to vary from one galaxy 
to another and/or could vary significantly without affecting 
the locus of normal UBV colors. Nevertheless, it is true that 
the main parameter causing the progression of colors of mor- 
phologically normal galaxies is the timescale for star forma- 
tion. The average UBV colors of galaxies of different Hubble 
types lie along the middle of the distribution in Figure 15, 
with the latest types at the top and the earliest at the bot- 
tom (de Vaucouleurs 1977). Thus there is a strong correlation 
between the structure of galaxies and their timescales for star 


formation. Star formation seems to be most efficient in the 


galaxies with the highest bulge-to-disk ratios, and, among the 


11 The models in Figures 16 and 18 use an IMF slightly different 
from Equation (2.9): the upper limit is 30 Mo (except as stated), 
and the slope is x = 1.3 (@ œ m???) for all m > 2Mo. The UBV colors 
would be little affected if Equation (2.9) itself, with my taking any 
value > 30 Mo, were used (cf. Huchra 1977). 
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spirals, it is most efficient in those with the tightly wound spi- 
ral arms. 


An obvious question is whether the shape of a galaxy is a 
consequence of its efficiency of star formation, or whether, 
conversely, the timescale for star formation is determined 
by the structure. Both effects are believed to be present. 
On one hand, more efficient early star formation leads to 
a galaxy with a greater bulge-to-disk ratio (Section 1.2): the 
formation of a spheroidal component requires that the stars 
in this component formed on a timescale less than that for 
gaseous dissipation in the protogalaxy. On the other hand, 
the present structure of a galaxy governs its large-scale dy- 
namics, which in turn has important effects on star formation 
(Section 2.3.3); for example, a more prominent bulge implies 
a greater surface density, which can lead to stronger gas com- 
pression and so to more efficient star formation in the disk. 
The papers cited in Section 1.2 and Section 2.3.3 give many 
more detailed discussions, and further references, on the ori- 
gin of the Hubble sequence of galaxy types and the numerous 
properties of galaxies that correlate with their forms. 

S0 galaxies, which are disk galaxies without spiral struc- 
ture, are an inscrutable class. Like elliptical galaxies, they 
have colors consistent with no ongoing star formation (or 
possibly a little, showing at short wavelengths; Section 6.1.1). 
There are currently two types of theory on the origin of S0 
galaxies: in the first, they are former spirals that have no 
more star formation because their ISM was lost, either in 
a collision with another galaxy or by ram-pressure sweeping 
due to motion through an ambient IGM; in the second type of 
theory, S0 galaxies had intrinsically very efficient star forma- 
tion in their disks at early stages. The second type of theory 
is preferred by some authors because S0 galaxies in isolation 
and in dense clusters have essentially the same low contents 
of neutral hydrogen (Faber & Gallagher 1976) and the same 
distributions of colors (Sandage & Visvanathan 1978), and 
because there are some structural differences between 50s and 
spirals (Burstein 1978). Nevertheless, the first picture is sup- 
ported circumstantially by the high proportion of SO galaxies 
in clusters, especially in their central regions, and especially 
in clusters with hot IGM (Oemler 1974; Melnick & Sargent 
1977). Arguments based on colors are not decisive, because 
models in which star formation stopped long ago have sim- 
ilar present colors to those in which it stopped only a few 
Gyr ago (Biermann & Tinsley 1975). Whatever mechanism 
cuts off star formation, the very blue galaxy content of dis- 
tant, regular clusters strongly suggests that many SO galaxies 
were actively making stars only ~4 Gyr ago (Section 6.2.7). 
It is especially interesting that some nearby clusters contain 
“anemic” spirals, with weak spiral structure and a subnormal 
neutral hydrogen content, that have been interpreted as disk 
systems at a stage of evolution between normal spirals and 
stripped SOs (van den Bergh 1976b). 


7.2 Colors of Peculiar Galaxies 


Galaxies with morphological peculiarities have peculiar colors 
too, as illustrated in Figure 17, which is a UBV diagram for 
systems in the Atlas of Peculiar Galaxies (Arp 1966). The 
width of their color distribution is in striking contrast to the 
narrow locus of normal galaxies (Figure 15), and on closer 
inspection the width turns out to be due almost entirely to 
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Figure 17. Two-color diagram for galaxies in the Atlas of Peculiar 
Galaxies (Arp 1966), excluding those with galactic latitudes |b| < 
20°, with corrected colors from the RC2 and other sources cited 
by Larson & Tinsley (1978). Crosses denote interacting systems. 
Open circles are two Type I Seyfert galaxies, whose colors may be 
affected by non-thermal emission. 


interacting galaxies, which are shown as crosses in Figure 17 
(Larson & Tinsley 1978). The implication is that dynamical 
disturbances have led to an unusual star formation history, 
so a study of these galaxies and their colors might shed some 
light on the process of star formation in general. 


7.2.1 Bursts of star formation and blue colors 


If the SFR in a galaxy does not decrease monotonically, colors 
very different from those of the standard models in Section 7.1 
can be obtained. The idea of star formation in “flashes” or 
“bursts” was introduced by Searle & Sargent (1972) (see also 
Searle et al. 1973) to explain the very blue colors of some 
dwarf irregular galaxies, and it has appeared in other con- 
texts including elliptical galaxies with patches of star forma- 
tion (van den Bergh 1975) and the peculiar galaxies discussed 
here. 


The effects of a burst of star formation on a formerly red 
galaxy are illustrated in Figure 18, where the heavy curve is 
the locus of standard models aged 10Gyr, from Figure 16. The 
colors of younger galaxies are shown in two extreme cases: the 
dotted line is a model with a constant SFR, evolving through 
the ages shown (in Gyr), and the heavy dashed line (on the 
left) is a model whose star formation stopped at 107 yr. The 
latter curve can be regarded as the evolution of a cluster of 
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Figure 18. Theoretical two-color diagram showing the colors of 
young galaxies (dotted and heavy dashed lines) and an old red 
galaxy with bursts of star formation of various strengths and ages 
(light solid and dashed lines). Details of these lines and their labels 
are explained in Section 7.2.1. The heavy solid line is the locus of 
standard old models, from Figure 16. 


stars formed in a period of 10’ yr, or equivalently as the colors 
resulting from stars formed in a burst lasting 107 yr. The light 
solid lines are the loci of models made of two components: 


(i) a red galaxy aged 10 Gyr, with no ongoing star forma- 
tion; and 

(ii) stars formed in a burst of duration 10’ yr, seen at ages 
10’ yr (upper line) and 108 yr (lower light solid line). 


The numbers along the upper curve are a burst strength 
parameter, defined as the mass ratio of stars formed in the 
burst to stars in the old red galaxy. Finally, the light dashed 
lines represent the evolution of a composite system, from age 
10’ yr when the star formation in the burst stops; these are 
lines of constant burst strength, and they cross the lower light 
solid line when the age is 10° yr. (The heavy dashed line is 
the limiting case of a burst of infinite strength, i.e., without 
any underlying old stars). It can be seen that a burst of star 
formation in a red galaxy gives colors initially above the nor- 
mal UBV locus, since the young stars cause an “ultraviolet 
excess”; as the burst ages, the colors evolve across the nor- 
mal locus after nearly 10° yr; then they fall below the normal 
line and eventually become imperceptibly different from the 
colors of an undisturbed old galaxy. 

Evidently the colors of peculiar galaxies in Figure 17 can be 
explained by bursts of star formation of various strength and 
ages in galaxies with various initial colors. Using a series of 
models like those in Figure 18, Larson & Tinsley (1978) found 
that most of the Arp (1966) galaxies can be accounted for 
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with bursts of strength less than 5% and duration < 2x 10’ yr; 
a few more deviant colors are probably due to observational 
scatter, internal reddening, non-thermal continuum emission 
(in two Type I Seyfert galaxies in the sample), and strong 
gaseous emission lines, whose effects on colors are discussed 
by Huchra (1977). The strongest bursts of star formation 
are inferred for galaxies in distorted close pairs, often with 
bridges or tails, and in apparently single systems with tails 
and filamentary streamers. Dynamical models for colliding 
galaxies predict features like these in cases of strong tidal de- 
formation and recent mergers (Toomre 1977), so it appears 
that violent dynamical interactions lead to star formation. 
For this reason, it has been suggested that the stars in ellip- 
tical galaxies could have formed in bursts due to collisions 
and mergers among protogalactic subsystems (Section 5.3.2). 

Figure 18 shows that the colors of models with bursts of 
strength 10% and infinity are very similar. The differences 
are less than the observational uncertainties for many faint 
peculiar galaxies, and the theoretical uncertainties in stellar 
evolution, metallicity effects, etc. In other words, it is not 
possible to tell from UBV colors alone whether a galaxy is 
really young or has 90% of its mass in old stars! Some of 
the galaxies near the upper left in Figure 17 are very chaotic 
in appearance, and are tempting candidates for truly young 
galaxies, but in all cases the colors are inconclusive and the 
appearance could be due to a violent collision or irregularly 
distributed star formation in an old galaxy. 

Colors at longer wavelengths, such as V — K, are much 
more sensitive than B — V to the presence of some old stars 
that would distinguish between a truly young system and one 
with a burst strength ~10% (Struck-Marcell & Tinsley 1978). 
Galaxies with very active star formation are often dusty, so 
red colors could be due to reddening rather than to age; on 
the other hand, very blue values of V — K would indicate a 
lack of red giants, and much stronger limits could be put on 
the mass of any underlying old component. As yet, there are 
no known nearby galaxies in which the dominant young stel- 
lar component could not be masking a significant mass of old 
stars. 


7.2.2 Highly reddened galaxies 


Some regions of galaxies that are suspected of having intense 
star formation are extremely dusty, and they show thermal 
infrared (IR) emission that is interpreted as re-radiation of 
starlight by the dust. Example of such regions include the 
centers of M82 and NGC 253, and the dust band around 
NGC 5128 (e.g. Kleinmann 1977; Telesco 1978). Star forma- 
tion is indicated by early-type spectra and blue colors in un- 
obscured patches (e.g. van den Bergh 1971, 1978), emission 
from interstellar molecules (e.g. Whiteoak 1978), and the lack 
of more plausible explanations for the IR emission (e.g. Klein- 
mann 1977). In NGC 5128, the dust band has an IR lumi- 
nosity of a few times 10 Lg (Telesco 1978), which rivals the 
visual luminosity of the entire elliptical galaxy. 

If the IR luminosity is assumed to represent the bolometric 
luminosity of buried stars, an SFR can be estimated (Struck- 
Marcell & Tinsley 1978): models like those of Section 7.1 show 
that any system with a mass-to-luminosity ratio M,/Lyo < 0.5 
is so dominated by young stars that Leoi is almost directly 
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proportional to the SFR; with the local IMF, the relation is 
Looi 
Lo 
An upper limit to the time for which star formation could 
have continued at this rate is approximately M,/w, so the 


limiting timescale T, depends only on M,/L»o, according to 
the relation 


y = (0.1 —0.4)—" Mo Gyr. 


(7.1) 


M, / Lro 
Gyr. 
Mo/lo 


Some galactic nuclei have such strong IR emission that 
My/Lpo1 is only a few hundredths, so the timescale is only 
a few times 108 yr. The dust-band region of NGC 5128 is 
also making stars at a prodigious rate: given a luminosity 
~10!° Lo, Equation (7.1) leads to an SFR of about 2 Mo yr}, 
so a respectable disk of stars could be built in just a few times 
the dynamical timescale of the system. van den Bergh (1975) 
has suggested that NGC 5128 could evolve into an early-type 
spiral seen edge-on, like the Sombrero galaxy M104. 

Since most of the bolometric light comes from massive 
stars, the above ratios of SFR to Ly. could be overestimates 
if low-mass stars are not forming. For example, in the models 
discussed, stars above 10 Mo contribute 90% of Ly, but stars 
below 1 Mo account for 80% of the mass formed into stars. 
There is no reason to suspect a lack of low-mass stars, how- 
ever, especially since low-mass protostars (T Tauri stars) are 
associated with dark clouds in the Milky Way. 

These very dusty galaxies with intense star formation lead 
again to the question of what truly young galaxies would look 
like. In the absence of dust they would be very blue, like the 
young models in Figure 18, but it seems that the regions 
of galaxies with the most intense bursts are the very red- 
dened ones just described. This question is important in the 
context of primeval galaxies at large redshifts, i.e. the early 
stages of most normal galaxies that now have ages ~10 Gyr. 
Starting with the ideas of Partridge & Peebles (1967), most 
models for primeval galaxies have assumed that hot young 
stars would be visible and lead to detectable luminosities 
(despite the great distances) during a brilliant early burst 
of star formation (e.g. Meier 1976; Kaufman & Thuan 1977; 
Sunyaev et al. 1978). A model for a very dusty primeval 
galaxy has been studied by Kaufman (1976), and Sunyaev 
et al. (1978) considered the possibility of substantial radia- 
tion from dust. If the very dusty nearby galaxies are the best 
analogs of truly primeval systems, as suggested by Larson 
(1976b), their prospects for detection at optical wavelengths 
are dim. Observational searches have so far produced null 
results. 

Another factor making primeval galaxies hard to detect 
could be that most galaxies have such a long timescale for star 
formation that there is not a very bright phase at an early 
peak SFR (Tinsley 1978a; Tinsley & Larson 1979). One argu- 
ment is that most spiral galaxies have colors that correspond 
to rather long timescales for star formation (Section 7.1), pre- 
cluding a significant early burst with a corresponding peak 
in luminosity. Moreover, even elliptical galaxies could form 
their stars over rather long time intervals if star formation 
occurs during a series of mergers among subsystems (Sec- 
tion 5.3.2). According to these ideas, late-type galaxies could 
be fainter in the past than they are now, and early-type galax- 
ies could experience their brightest evolutionary stages only 
a few Gyr ago. Galaxies in interestingly early evolutionary 
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stages have indeed already been found at moderate redshifts: 
distant clusters have excess numbers of blue galaxies (Sec- 
tion 6.2.7; Butcher & Oemler 1978a), and counts in the field 
show a large excess of blue galaxies at faint apparent magni- 
tudes (Kron 1978). Further studies of these phenomena will 
surely shed light on the ways in which stars and galaxies have 
formed during cosmological time. 


8 CONCLUSION 


Returning to the outline of galactic evolution in Figure 1, 
one can see how much remains to be learned before the jig- 
saw puzzle will be complete enough for a clear picture to 
emerge. Essentially every aspect of the subject needs further 
observational and theoretical study, so galactic evolution will 
long be a fertile field for research. 
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